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THE BASIC UNIT IN MEANINGFUL LEARNING.. 

ASSOCIATION OR PRINCIPLE? 

(A Set«Function Language) 

Joseph H* Scandura 
University of Pennsylvania 

Theoretical development in educational psychology has been extremely 
slow* A major reason has been the lack of an explicit language with which 
to discuss research on meaningful learning and teaching* As McDonald (1964(jr 
542) has put it •****occBsional attempts to make a confusing conceptual 
formulation understandable are not sufficient* Conceptual clarity means (a) 
sp^ ficationf stated in terms as nearly operational as possible! of the be* 
havior involved In a task or methodi (b) some delineation of the range of 
phenomena included and excluded; and (c) precise description of the appropri* 
ate tests*” 

Stating research objectives and defining variables in unambiguous terms 
is not sufficient* The teaching-leaming process has all too frequently been 
studied in terms of traditionally defined categories* Much research has been 
of a fragmentary nature; similarities and essential differences have gone un* 
detected* The variables chosen need to have general relevsncCf not be inex* 
tricably related to the question at hand* Without this eharacterlstio« re* 
search findings » almost of necessity remain isolated* 

To provide a substantive base for their researchf educational psyoholo* 
gists have frequently resorted to the languages! paradigms! and theories of 
the mother science of psychology* Mediational elaborations of the S*R Ian* 
guage! operant conditioning paradigms! and more general! but less well sped* 
fiedt cognitive theories have been popular* 



Bach approach has important Xlnitations* one point of viev^ par * 

slBony suggests that the properties of overt S-R associations should also 
be attributed to mediational links. Yet, piractlce has shown that nsedlatlonal * 
interpretations become increasingly cuosbersone and less precise as situations 
become more complex. Sizoilar difficulties have plagued researchers who have 
used opexant techniques to study meaningful verbal learning. The leaults 
simply are no where near as clear in complex human learning as they are in 
the "Skinner Box." It is increasingly recognized^ for exaiz^le^ that know- 
ledge of 3 result 8 Is not directly analogous to feeding a pigeon and that^ in 
any caae^ other factors^ such as subject laatter stnicture, are probably of 
greater importance in pronoting efficient learning (e.g., Bruner, 196O; Gagn^, 
1961}. A general limitation of cognitive theories is their relative impreci- 
sion. Ti^ically, 'cognitions" are either not clearly specified in observable 
te ^ms or are only partially defined.^ 

A most loportant outcome of recent collaborative efforts between educa- ‘ 
tlonal pc^cboXogists and subject matter specialists has been to focus atten- 
tion on the close relationships between task and method variables. Subject 
matter educators and psychologists have increasingly come to realize that re- 
search on meaningful learning and teaching must, on the one hand, deal with 
observables (i.e. behavior) and, on the other hand, with subject matter struc- 
ture. Little of scientific and practical value to education can be accomplished 

■^Chese criticisms in no way deny the importance of existent psychological 
theories. The S-R theories have proven invaluable in dealing with simple learn- 
ing while cognitive theories provide much needed structure and explanatory mech- 
anisms for extremely coDiplex phenomena. 
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by talking about either unobservable mental processes or structure-free (l,e, 
rote) materials. There is need for a precise language^ couched in observables^ 

. vhlch also provides for the description of psychologically releviamt subject 
matter characteristics » 

A Set-l*unction IjMaguage (SFL) has been devised to meet; this need. By dis- 
regarding certain of the subtleties involved in simpler fonas of learning^ vhilo 
representing gross characteristics, the 3 FL makes it possible to deal with many 
questions relating to meaningful learning and teaching in a precise manner. 

Ibis language not only provides a symbolic means of describing certain research 
problems, but makes it possible to consider structural and behavioral questions 
simultaneously. 

There are several vays in which the SFL can be used to make research on 
teaching and learning more explicit. Hrst. learning. objectives can be well de- 
fined (Gagne, 1965). Differences in learning type can symbolically be represent- 
ed. Second, distinctions can be made between presented information. It is pos- 
sible, for example, to specify the difference between presenting a principle 
(i.e. rule) directly as opposed to presenting an instance of the principle. 

Third, the ia^wrtant question of "what is learned*' can be pointedly discussed. 

The sort of stage theory of instuction and problem solving, to which Qegne"^ 

1964) alludes, ultimately will depend on the ability to assess learning type. 

' « 

After establishing the acquisition of specific knowledge, the question always 
mains as to whether the learning was rote 03.* meaningful. Fourth, predictions can 
be stated in unequivocable terms i/hicb relate to the other distinctions made. 

Only by being able to discuss z*elationships between inputs, prior knowledge, and 
the criterion task can research, on meaningful learning and teaching, e:qpect to 
make substantial progress. 

o 
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In addj,tlon^ the SPIi can h a nd le relatively cccsdex altuatlons vitn the 
precision of the S-R inediation language without beccoiing cusobersosne. . Appllca* 
tlon of the SFL to persistent problems has not only provided clarification, by 
abbreviating arguments, but has helped suggest n<b? questions and reformulate 
others. 

• ' • 

In this paper, the SiL is described, its relationship to the S-R Mediation 
language is explicated, a methodology is presented for assessing vhat is leanMid# 
and a variety of research questions are (x%}fonmiLated in texns of the SJIi and; 
the related assessment methodology ^ 



SJOT-FUWC^nOW lAIKWAOE 

* * 

The essential diiffexence between the S«K aaedlatlon language and the 8]^ 
is that the ftgooclRtlgn is the basic unit In the foimer whereas the SPL aahas . 
central the notion of a. principle.. Principles are syttboUcaJly represented 
as functions, f, which »atheaiatici3a,ly are sets of ordered S, R pairs of a par- 
ticular type.^ Responses are normally symbolized, f(s), so as to indicate 
their functional dependence on the stimuli. 

In Table 1, S-R mediation and set-function fonaulatlons of basic learning 
types are contrasted. The mediation formlatioas are ail in teras of S-R 



Insert Table 1 about here. 



associations; the SP1» fornBalatlone are all based on the notion of a funetlcn 
(l.e. principile). In each case, principles as rules of the forfe "If A,, 
then B" where AandBaayornaynotbe cc«icepts (c.f. Gagne, is6k), Aecordll^ . 
to the SFCi, a concept is a "degenerate" principle, one in which there is a. 
coataon response (i.e, the function takes oh only one value throughout its 
stlmaus domain). A discrete 8-R pair Is simply a one-element principle. ^ v . 



I 

Only those sets of ordered pairs, which satisfy the mathematical definition 
^ considered here. A set of ordered stimulus-response pairs is 

a function if and only if each stimulus member is paired with exactly one response 

learning types are easy to come by (c.f. Gagn/. 
scheme proposed here, however. Is not based on the 
assiOTption uhat the more complex forms of learning- (e.g. principles and concepts) 
on stopler typeb (e.g. associations) as was Gagn8»s (1964)* svstem. ^re 
is said about the dependence of one principle on another in a later section. 
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TabXe-l 







Multi-Stage 



Type andilDeflcrlptlon 



iMediatioa 



Set-Function 



Simple Chain- A chain of 
diacrete stimulue-re^onse 
connections 



^H«K»iain of Principles . 
A chain of xiaes 



S-y«-R 



.where 



g*f"jts,g*f(s) ]| 

e *H(S,f(g)]l 

are independently determinable 



where . jtS^,fj(S^)) | i«l,.... 



and J«lj 



for • a *Bi 



and «« |[fj»g(fj)l I 

are independently observable rules 



^ The mediation schemata have been sinplii'led by letting the symbols rsg 

represent both the medlatiag response and the response produced stimulus» 

** The mediation formulation shown is one pessiblc way rf modifying the S- 

aSS!!S*»*« represent the notion of s principle. Gagn/ (1964) has 
orrerea a non S-R schema* 

ways of csmbining rules, only s set- function forsw- . 
caralex**^ one-type is given. Any itsdiation representation would be extrewi 
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feBifUln «4ditionai coi«aetot* on the elB^'-stsae.senidiigpB in'teble l are Ja’ 

’ • . . t ^ * • , 

* ' *..*.*-' . . .* 

ortfi:^. yirit- coiuslder the deflnitton. of; a» eoacejt, ...1» eonie situatio^e^ ' 

' .**“**.' ‘ ^ *, * * 

that is required it for each of several itiawii & he ipalred vith a comaon ; 

• « • M » # '*# 4 M ^ , * , * * 

response, ^or other experlfikenlil'purj^ the stisull ssurt be joulti-dlM- . 
sioned. R-ithar than repJTeseniiiig such dlffeseftc^ii;*^ ui^. on«;^ tvo sta^s.^; , 
S-R association jsaradigBa, the distinctloffli is loade in the by considering 

sets of ordered lalrs in vhlch the nuitor of different fimctl-»s yariea. Vhen 

• . . . ' 

S 4 _ ■ ; ■ V • - 

the stiiBuli are discrete, different rules are Involved, but. the response la, ,4 
identical. The rules might be, "If (the stliwlua is) % then (the response Ji -; 
dog; If blun, then dog; tW stimuli are multl^dlasnsionid a ‘^v . 

Single rule Bsy suffice — e.g". *‘2f large, tlSn'dogi** ' 

■■ ' • ‘ . . ' ’ ' “ ' **.^*“‘\* 

Another point of discus^on involves the complexity of ah association . 

representation of a principle,, At Xedst two. sets or mediating stlaiOLi and; re- 

■ • • • 

ttponses (rs) need to be postulated in order to capture the essence a 

'• * * •*'*.- 

.pLe of the fora, *'If (the, stiisulus ii) ' large; , then (the r;e8pon8e is) the nails' ; 
of the stimulus color,” The i^iatOrs rf*^ and rs>’ in Table 1 represem the 

« 4 ^ f ' ' »*, »»• 

relationship, between the S-R^pai^ coap^alDg such principle. It is cei'- « 
teinly reasonable to thiafe of. the stliwll Xla3«te colored objects) as eliciting 
a coHKon response >•* (large), of rs* eliciting rs** (color), and of rs** v 
eliciting each of the responses (color nsaes)* This breakdown does not, however, 
indicate why the responses belong to their respe^tlye stimuli, ' Tbc second set of 
iuediatersi the f s'; i*l,...,n;| serve this purpose.'** ■- • 

I ^ 1 i 



I w e 



^It is important to note that the r^ albne', although they tie the corres- 
ponding stimuli and respobses together, fftil to represent the relationship be^, , 
'tween the pairs. • Such reldtioesshlpc are' crlti'bal to the SjFL approach becauad 
they provide the basis for the assessment methodolo^ described in the n^ 
tlon. 
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Rccogni*ing the dlfficuities involve^ Gdgiae’e (i5<54) original repreaan- 
tatiofi of principle learning did not nsc the S-R language^ but rules of the 
form, "If A, then B,”- where A and B are concepts. He represented concepts 
slsQle forms of learning in S-H terns* Because of the contixmlty break between ; 
the. concept (S*R) and principle (rule) schemata in Oagoe’s classification sys- 
tea (1964), Tracy ifendler (1964, 322-323) raised the question of whether neW 
properties emerge at the principle level. The SFL formu^tion shown In Table 1 
suggests not only that new properties, emerge, but timt these properties can •. 

form a basis for simpler fonas of learning as well. 

The crucij^ argument in favor of using the rule, rather than the assocla- ■ 
tlon as the basic behavioral unit is that of 'simplicity. S*R representaticfni.. 

■ * • ' ' . ' ‘"■i 

• of principle learning {let alone the lecuming of several principlels related la 
various ways) are cuBd>erscme .sjcd are not likely to be useful in dealing with 

research questions on complex leamixig and teaching. 

One vay of ceflbining principles (i.e. "discrete polrs, conceipts, or principles; 

is ty chaining. A slaple chain lilte "«*ay-naiv«i»ii«'' «•« '>* Jrepyesented, . • 

. . **..*'' * * . • 
, (3,g-f(S)], where ls,f(a>} and [#(S),.g.f(8)] correipond to "awsr-navy" and .• 

■ ... . . ./ ■ 

"navy-sailor," respectively. ' 'v.,.-., 

Fomilatlng a vsrbai chain in' the 8fe:%ae^(l!ts a n*f dlre«ion for r«- 

■ ■ . ■ ' ‘ ' { ■ 

search in that area. Ihe principles, f and eottld' , ; 

• pair*. On the other haai, f night be of 1^ forlB,\?!:tt. tJia worf-itlatulos, refew* ,• 
to one end of a sca3.e, then give the word which refers to the other end" or •: . ; 
"If the stimulus refers to land, then give an anklogqiis. word which refers to 

► . • ' ‘ ’ * .• . ’ .V 

water." Thus, f might ccaisist of the pairs, (ariiy^n^), (hot, cold), (tall, ^ . 

. Kherb),. . . or the polrs /.(onw, n^yy), .(alrscrt/'toc^n carrier), (don- ; 
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tinent, ocean), , ♦ . . The principle, g, also could have any one of several 
referents. If the operating principle could be identified prior to learning 
a test list, rather specific predictions could be made. The a8sessKai.t nethr 
odology described in a lat er section would have relevance in this regard. 
Another possibility is that the principle referents themselves are manipulable 
(e.g. by experimental instructions). 

What appear to be still more complex situations often yield to SFL analy- 
ses. Consider a task like stating a rule for finding the sum of particular 
arithmetic series. The initiating stimuli are the series, the mediating re- 
sponses are particular series types (e.g. those series consisting of consecu- 
tive odd numbers beginning with one) and the responses are the formulas, that 
can be used to find the sum of any series of that form. The task may be repre- 

••Y 

sented by j^j^gif where f is the rule, ”If the series is arithmetic (l.e. there 
Is a common difference between terms), then the type of series depends on the 
first term, the last term , and the magnitude of the connnon difference," and 

respectively, "If 'odd,' then "If *even, ' then 
In effect, classical learning, types can be represented in the SFL by 
specific kinds of principle and chains of principles. 



^ere is a one-stage formulation of this situation in which the stimuli 
again are arithmetic series and the commoii response is the general formula, 

(a + l)n 

for finding the sum of any arithmetic series. 




mmmz rbxatzo^hii^ 



A icheme la presented in Figure 1 for claussifying relationships between 
principles involving stimuli with the saiae dimensions. To illustrate each of 
these relationships consider a set of stimuli in which the attributes are slse^ 




OVEBIAPKtKG 

yig, 1. lihree types of relationships between principles. Each circular region 
represents an identifiable principle. 



color> and shape. Let the two principles be ”If large, then color” and "If 
small, then shape.” Since it la impossible for an object to be both large 
and small these principles have no instances in camon and ore said to be 
discrete . The principles, "If tricinglu, then color” and "If large, then color" 
have some instances in common, those with iiarge triangular stimill. Each prin- 
ciple also has additional instances of its own — e.g. small triangles and 
large circles, respectively. Such principles seem adequately described as 
overlapping . The principle, "If triangle, then color” is aore general than 
the pri.icxp3.e, "If triangle and large, then color" since tilia former includes 
ail instances of the latter plus some of its own. Such principles are ordered . 
Generality.- Scandura, Woodward? and Lee (1965) have shovm that v.he behavior 
.Induced by presenting statements of ordered p.rinclples conforms ejqpectaticns . 
Two eaqperiments were conducted, the Independent voriab3.e in both cai^es being 
princ iple generality - 

In the first experiment, each group of 1? college Ss was presented with 
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illiist.T&ti-'.re €i>:anp'.e v^i V}?A-ied iiub'p6*r.d6rirJ.y cf pr3ncii>l'2 gensrality. 

'[•he pattern of res'C’.ts .In thi^j C3tp:si*5.n-j':Xit wa-; a most idcnlical to that -tn 
the preceding of ohe tioiv^id o. problem bsiyo'od the scope 

of thf; pr:'.v.c5.ple jjfe{;«c:.“v:.ed;. viheth^r an c:^amp.:.e ■:. or i:ar.; not Fcviir 

o-’’ the 19 3i-:*Jv;h-<'-.rti^‘5Jn.‘:.c l5s^ b/nr<.:V'.ir,5 ^sc.lvod -.j.-.e gener-^al Trroblem 

(tte-ij) IB solv.d h; aac. :U so?.r^ prn«'dti£i t-wo (l><-CC4). 

vfas atr-ihut'ii to 'i concept .i.:l difi>r?;jn';e in i {j/imsm req>)ired for 
detexiaiciii?.^ ’the riuiiber of tar-i’S in ths r.hii’d prob:ie;. seriec and in the 
oth?srs;. incl'ivling the eiiang 7 . 1 s.. 

KOaiethsless^ these .results certnilnly j’d.ctify the ordering of verbal^.^/ 
presented prinoiplr'S as to .-i^enera-Ut.y.i inter- 

fering exanxlea a^'f; not use'h ~ 

ftb&tractne. 5 ; 3 .- 1%.- addition tt' tr?nt-:f2r poi 2 EAiai> &osm. o.?. the principles 



» Mt«M* vr<rvM* 



differed ?.s. to l^jarnabil.ity. S iMslu.ikt f.r»perimerxt ono. -was signifi- 
■iantly e&f;ier to .leB-ynj as ;Judged by pi-ctleK- perfor^^;i'.’. than were 

};ne rulfei. i'.C «;nd G- (p<'->(?OT m bv7tb c^s^j- 0* esi^-env. allj/- no dif- 

ference in Ji?a-/:TO.r5j, ruios’ ‘X- .ind Gv U<e t at.lcipated f.ind:.n^ ^‘-dlfferenoe.1 
due U gcr^'rality, ■ t€!tv<;:fcn ' ho latter groiips as wfai ancle crit;.7a.U:y;. we 
•were temp ted to jS/ttribut-e tb^'? Xat-i; c>f such ^sffect to 'jcaa.^^ :is-?nsiti.y« ? 
.ity in the i{:;^Kr;r riirigejLw 

Fm-'thor ariclyfis vf t4ie .rdtuaticur hcw”'er,j> IniUe^ted :h?it 
3j£iy ha.ve diffa::’erl sxi to '?wo posEible e5q.danntioxn: 

siigges'ced Cr-.e;Kuialvf s 7'he ‘r^les SG suid G Wftie both .’3Jor«D 

-cU'At there -/re^s «core t:- and cKire abstract, in tlia;t centre 

7endin<.x a ;.-ule for dist^vj'i^ii'nlr.g u nuJAber }ias sy^rs prc*. 

l'V3s thau 0'3:ap:’.?.hfcndin;g ,TOec.ir''’.c 






\ci ;?hould be noted thu.t 'olu;: rules were not stat?id in a font whicr; 
.'fOiiid ariow thy Ss to discr/veUm^ie between, psobleass w!iere the rule ms 
iind was Aiot abprooriate. In hIX of th '2 rules •were of the fona, 

■’tnere is a pitt'^k to tht3 ^/fden will snaole you to win •whenever you 
mro illowtfd to r;\ake t.jB so.’unrnion . Xov. must, hewever, /nai:e mi ap- 

pro^rtate fi-r.st ielaollvi; and 'omn proceed according to s specific patt-ern. 

Thia -procedure m-f-ie in; p:sr.:-ble be A'JiJO obtaii^ Informticn on response 
con8l“sbenc'/ ^t'ol.ch is i:l6cur»iicd in a lato.r ssc.tion (.is£>e6Sirient Methodo3.ogy J < 
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Xri esipiriruei'it i’;v *s:^d 3 3 3a fe'iv,- mKi fr 

•^r^j.y tbe f-ft (<0^\ lO) Fi’<irC<iy^ 

♦Its %r/5re surj-Tini^d tUut uxIy ::■ ox- '.hv- ?:$j,- i iiv}3«."Ut(?;-'?."j¥lf;«. nL.<n.. 5'^^ -f: 50 > 

uw tlwi coilTsirt i:>;a:i (rt'3<')0j -;'a th« a'r.r-s'^ !i’he f\‘/^i. tfe.ti .i!iJ- 

uv- estcfept. tl:,c<fc‘<i i'i, ^•h.s *;i‘,-f-, ’r;t> S vv.^:i.‘33 ^ls;^;£ViT?3'3i.3’' c:$;p}..Ud tho 

ru3s to «.acn ( ‘-./.cirthnvv epp.rc-p.! :.^;.t^.: or nr;t)- huvrmt^T, g&i.ns ua 

«- f:lu«. Khei^j?,p. all 3 (T) or t4''.e bG- (G) |sS;. :.ix ’tvO) ^ho unaiL 

t'hf* rv.lt' fcs.v.ght or; {lol’.pjn or/-- r.a;? I "Jja wo ■*^:iiti. t/arcja 

c-rl,y cue cf sl:j'nt Be in th 4 > -, ,■;r^y:^p did ;io. li^r..r><i Sn ajn;i-5:reni.ly rcf’io^'nlasd 
the liui)j3ciioabi'.il-y Oa tie ru.i-.tJ. ia:-3.c-i t.c- r<in]>r4?.'i/" Sixoli relTU;i:flv.vO«5 

li^Ar'i'. 'inean ev; ‘.'n ;.U“ori ;'s o the {5a nxid f>c.ru« 

** ** 

tlu:lv:.1.n^j the f: :r‘.t papt.-.i’is ..'is.-', thic a f${;v.j;;.l'le iritPir 3 .iJ’?it;:::.tion. 

?l7:;at we :=>.;•■ led to do v^j-lc. ‘.jhat :'i l-i- -lone ij to deal c::-5?lloJ,tly 

v:-jth sifiteiien: <;asp.l?:c,i;vv' &jic. c- .h3trM uJ.t i , It wovld be ex- 

tremely de.^i;f'db.lo t,o ?.vr;e sera-.* ;iv.;an&i x;' ” ’’eo.liTiiiif.iiyl'ig t>iese che.rac'i:eT.jL ' 5 «* 
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In ^siy .:*aae,j I ro.«:- that rr-lo.x'* xoii-xiins: Jesg, Gt' 4 ;iiS;. 1962 j Seandura? 
19*!>5»ts,;- hi; l:J;oli- to bi- a cnioi.a?, ''?u;tsyr.. It iaajrj in iHot? :Ih\- 

po-.ss.lh.le TaO {j;-.ve C; r»fhJAc.o:\,ai. 3 ’ n€.ar.i.'.^}6;-:- tl of .-yither at,eqUsKlb'/ 

or al:-?>tr'av;t.vii‘.r3r; ^iilLou*. e:-i:p3..h‘;lo rfif ji.'eim': to prersqulriite knc«tledgn<. 

“\A.7x"ior/u cll oi: ’:.-i< . ve j_l=t '■n;.evn t’x batfup;!/-i V'J:-'i‘,o istlil 8in.c*r 
'jor'.Pri'! cii-je .Icler>t:,.a3-1 v.i th :V tT.:.o rnr^- t,h.« cajio in b 3 <pa:riKife b;Tie 

ir» wfeoh st‘U. o.t the | a w..;;i*!v ox" ic a;, i:a!r}•.^y^ 5 ^.c of ths {^, 31 ) 

"hi-j'bii riftnj'’ mviarita of 

.t oau pcbeihlj- lie tia.:; mc-'t ,;,';;r.i.oj' high school. Bs 'f/sinild 

i-h'. t;x .1 • :> - •>•• 4 - oo md 1 +'3 4* 49 
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Since I have gone this far, let me go a little further and suggest 
a behavicrally relevemt definition of abstractness * One statement of a 
pr^ciple is more abstract than another if the set of prerequisite prin- 
ciples of the former properly contains the corresponding set of the 
latter. That is, the more abstract the principle statement, the more 
prior knowledge is required. According to this definition, *'lf (given 
an) odd series, then the sum of the series la obtained by cooputing 
Inhere N is the number of terms In the series, is more abstract than 
"If odd series, then the sum of the series is obtained by multiplyizig 
the number of terms in the series by itself. " Being able to give the 
appropriate product when told to compute where N « specific inte- 
ger, is prerequisite to applying the rule when stated in the former 
manner. The latter statement makes no such requirement. 

According to this definition, the representations of i particular 
principle can only be partially ordered according to abstractness, 
figure 2 indicates how various representations of the principle, 
may be placed on an abstraction lattice. The middle representation 
on the left and the one on the right are not directly comparable. In. 
the foxmei“ case, S n^eds to understand the symbolism (i.e. N X N) 
and, in the latter case, S needs to know how to find the numb r of 
terms in an edd arithmetic series. Some of otir recent experiments 
(Sesndura, 1965a, 1965b) suggest that unless S can pperational'ly 
make use of a tern, statement, or rule; an explanation of a more cos- 
pXicated rule using these notions is essentially not understandU\{bXe 
(i.e. can not be used effectively). 
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_jBbfttractnei6 




I Find N; ccupite • 






Find N; conspute N x N 






Explain hoir get N; confute H x » | 






tit. a. A Ml® for oWi*tain« the Mp of oW eafithpetit eerie* Breaented 

at various levels of ehstrMtion.- The sietemente "Eepliln htw get H" 

la ■ and ”coiBpute H x K” correspond tOy^etate^nft8> in * ct^ttioon place 

t*r«»^'''finding B” and "ccerputing BS," respectively. 

A 



Coa1unctiQP^Dis.1unctloa a - So. fur,, reference has l^en aadt only to 
junctlve principicfl, principles of the fontt> "If A and B> 'hen C® As 
with concepts (e.g® Bruner, Goodnovr, and Austin, 195^)^ disjunctive 
principles are of the fora, "If A or B, then C» Thte set ptoperties 
' of principles, hoirever, indicate that every disjunctive priiciple e\m 
he expressed as tlie union of two or more conjunctive principles with 
a coBtRon "then" clause* Thus, the union of the principles, "If- A, thei:^ 

' C,” and "If B, then C," includes exactly the same instances m does the 

disjunctive principle, "If A or B, then . . 

In an entirely- analipm fashion, disjunctive conceits Bruner 









f 



IT ■( t W i$, iU 1**^ ^ 



. 1956) corre^)OSicl to two or co.tsjtittct^.'vfe c0^2ept» wit)^.i> 

ccoBKH response. Wlyasi looked s,t in this It %n cot har&lte 

' < . • • , / 

see \ihy di«,j\mct.ive concepts ?.Ye harder to .iemi ecsijonctive 
concepts o There are ]^:^ 'cor.icepts to Xeam s.t ‘^KSCT'be harder -to 
iearxx two or tiore concents than it is one»^ • ‘ ,-. 



■%yjnbolic logic can' be used to ayi&bolize the relationship^^ involved 
in conjunctive and disjuactl’/e principles. . For exan^pltv principles of 
the form^ "If A and then C and P, " "’If A or % then C,” and "If A, 
then 3 or C#" may be symbolij&id as: '' 



( 1 ) 

( 2 ) 

D) 



(A ^ B) ,{C • D), 

• . •» 

(A V B) ajid 

A (3 V C)-, respectively. 



^ This use. of the statement logic ir> similar to th&t by Andersjjn, 
(19^) to anal^-so S-R mediation thebrv, The l/ett-< 5 ri|.A> JO/ koA P/ 
iaay refer to stimulus cues and dimcn 6 ior 4 c\t ‘to nooinal' stimli and 
responses. We dejiy t-hc2 n<^®d to consider principXs'ii of the form;, “"j 
{see footnote p.. 21). It is VH^yond. the scope of. this paper to more • 
ful 3 .y consider the relationships between the Sl'Iir ajsd ’the statement • 



logic. 
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moML FimcnoRAL stimuIiI 



So stiliiull ♦ have been treated as unitaiy eleaifacits* Their- nwltidi- 
nenBlonal nature has been considered only incidentalOy* In .statins the prin* 
Qiple, “If Iwge, then color, “ for exanple> reference is made to both the siae 

and color cues of the. eliciting stlimilus; 

In the S-R language, a distinction is often nade between nominal and 

» • • t ■ . • 

functional stJjamli, the nominal stimulus ref erring, to that physically present 

' ' " 

aM the functibiml stli^us.to that aspect of the stimulus which determines 

the overt or- Implicit response. 

> • * ' • 

In the SFL, in which the principle becomes the fundamental unit, not only 

is .the norjdnalrfur»tiohal distinctly Ij^. ^ctional stimuli are 

' ‘ "»• , » ' 

» » • ” ^ \ » 

• J ‘ ^ I -kI - 

classified as to role. A. tmitimai stimulus may; serve cue a principle 

(e-.g. large) or a response (e.^. color)# The former is denoted a “prliwiple 

identifying cue” and the latter a “i^sponse determining attribute# 

•A principle identifying cue(s) Is cdnay to t|^ stimulus of each 

. instance of the principle# A response deternihing attribut# or coaiblytiy 
* ’ .*’>.*«• * *. 
of attributes varies 1**1 with the response ^Msbers when . all other del^iinlning 

K » » .S ^ 

attributes remain constant# Consider a princij^e* .reprefented by a statement 

of the fom, “If a, then B and C” where A ref ew* to the ecssaon plQ'Sical stim- 

• . * *, * 
ulus property and B and C refer to dimensions (i#e. classes of physical stlm» 






^^e S-R language Considers only iresponse determining cues and, in fact* 
there is no apparent need in the S-R lan^ge to make the above distinction icx- 
• c®pt possibly in discrimination learning;. Prliwiple and response iicntl^ing 
cues are ide atical in simple association and concept learning types. Wi^ in- 
spect to the pair, {*, Rug], ^ probably serves both. funcif ions. In conce^ learn- 
ing Involving multi-dimensional stimuli, the functional stimulus \e.g# red ; 
alrio serves to both cue the principle and the response. 
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uius characteristics). Each of the principle related stimuli have the prop- 
erty a and one value of the attribu*.es B and C. The responses_, whatever they 
ere, vary with B and C. If the value of B (c) is held constant, then the re- 
sponsea depend uniquely on C (3). With respect to the principle, **If the 
stimulus is large, then the response is determined by the color and shape. *' 
a corresponds to large. B to color, and C to shape . 

The responses may be merely the names of the color and shape properties 
or they may depend on these attributes in a more subtle fashion. For example, 
the principle might have' beem, 

"If a series is arithmetic,^ then t*.. response is determined uniquely by 
(1) the first term in the series (A), the last (L), end the number of terms 

(N) according to the formula ” 

The stimulus attributes (A,Ii,N) again determine the responses, but they exert 
this control via an algebraic rule rather than by "naming" colors and shapes. 

Notice that verbally statiiag principle (l) and having S learn to repeat 
the definition verbally would not guarantee tiiat S could use the principle. 

Being able to identify instances of a principle and to determine and combine 
the response identifying values are prerequisite . to using (i.e. "understand- 
ing") the principle. This undoubtedly is what- people like Bruner (e.g., 1961^^) 
have in mind when they speak of. the learner acquiring e sort of verbal . 
glihness without true understanding. 

It is also worth noting that the generality of a principle depends on the 
naniber of stimulus dimensions that vary with the responses (i.e. the liumber of 

■■■ ■■jim.—ljbj.iwm iwrTMii»j -n mini ii» mrr nwr~'wii~nri~T— n— — 1 — 

^In this case, a common difference between terms wou3.d be the principle 
identifying cue. 
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response identifying attributes) and the number of principle identifying cues. 
Th\is^ the principles, "If large and black , then (the response varies with) 
shape” and "If large and white , then shtpe, ” may be considered special cades 
of the principle, “If large, then (the response varies with) color and shape..” 
The latter, raore general, principle at once has fewer identiv'Vlng cues and more 
response determining d^jaenaions than the other two, more speci-fic, .principle.«<. 

In effect, it appears that critical response dimensions ore traded off with 
critical principle identifying cues. The more general the principle, the more 
stimulus attributes vary with the responses; the more specific the principle# 
the more stimulus properties • are required to identify the principle. The total 
number of critical properties remains constant.^ 

An example fi-om thd "real world" ' Indicates the non arbitrary nature of 
this iayarient. Consider the principle, "If an arithmetic series has «, common 
difference of 2, then the sum is given by {A + N - l)N. The principle iden- 
tifying cue (within a population of arithmetic series) is that of a cocaim 

€ 

difference of two between adjacent terms* The responses vary with A and K 
(where A is the first term of the series and N Is the mmiber of terms). 

The stimuli T'd.this the scope of this principle are all arithmetic serial 
having d common difference of two. They differ ae to the first term of tha 
series and the mi mber of terms* A specisl case of this principle might involve# 

for exaaq)le, only those series beginning -with the puafiber one (an addi- 
tional identifying cue). Here, the sums (i.e* responses) would vary exclip- 
aively with the number of terms, as given by the formula (1-+N-I)lf*!l2. 

Other special cases can be similarly derived. Tha essential point, again, in 

— ■ ■■ ^ ^ 1 i r riTnrnnr<winn ii—— i wfci«i in »ii !» ■ ■ ■ ■ I— ii i i K i w M ■■■■iiai iMiiiiii m nueiiwieLW— ummi mu— -Mjuu-uju-^^j^jijnwi i n ii ■ 

have said notliing about the need for principles of the fom, "if A# 
tJ\en B or C." I am of the opinion that sudh probabaliotlc respanse determination 
Is laore apparent than reaiV resiats, prlmarilyi from our inability to identify 
those ctimLus pxoperties, i) and D* , distinguishing the principles, "If A. and J)> 
then B," and, "If A and D‘, then C.” . . 






the constant mmber of arltl^al stl\m2Xus properties.^ 



^IThis invarlence can be stated and eaitily proved as a theorem, tlrst^ 
some definitions are required: 

(1) A stimulus cue is said to help identify a principle if the cue is cosmon 
to all stimuli within thr 'iiriaciple domain^ 

(2) A stimulus dimensio combination of dimensions (idien dimeMions are 
conrelated) is said to ... .p identify a response if the values of this dimen- 
sion vary 1-1 with the responses when all other stimulus dimensions vemain 



fixed. 



(3) P is a partition of a principle^ f^ if P 



n 






iHi f i ^ f 



a , «"i7 5 ) 

if 1 ^ 



where 



In vorda^ a set of principles, P, partitions another principle, f, it an 3 -R 
pair is an instance of f if and only if it is -m instance of one of the 
principles (f.,i«l,...,n) in P and no S-R pair is an instance of aore than one 



of these principles, 

( 4 ) A partition P is said to be nonredundont if for each l»l,«...n, a con- 



stant number of the response identifying dimensions of f remain constant over 
fj. If only irrelevant dlrensions of f are held conatant, the jartition la 

said to he redundant. 

The theorem and its proof follow directly from these definitions. • 

Theorem . Given a principle defined over a set of stimuli, then any no 2 iredun- 

dant partition of f preserves the zrumber critical stimulus properties. 

» * 

Broof; Sy definition of a nonredundant partition, at least one of the response 
idenvijiying dliaenslons remains coi:jietant. . Suppose k dimensions remain constant* 
Then, by definitioiu, there are k more principle identifying cues for each prin- 
ci]^e in the partition. Therefore, the number of critical cues is inivarient. 

Actually, a complete formulation of thi.a problem requires that a precise 
definition be given to the notion of a dimension (c.f« Beetle, 1961). This is 
beyond the scope of the present paper. 
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KlVOtODCSB AS8SSa«Elir . 

i. learned . Cco«lt»ettt to e stege-apj^ch to research o.. c«ple* Xearh- 
tag and teaching necessitates attention to toowiedge assessn«tU Suppose, for 
exMgie, that stage-one c<meists of learning the list shown ia figure 3- ®*se 
pair, could be learned as four distinct single-pair principUs or aa instances 



A 


^ Blae}c 


® ■ » Sb«U 


A 


^ IJhite 


O' 


Fig. 3. 


Four S,R pairs 


inwlvlrig two principles. 



of two or »ore general principles, "If triangle, then color" end’ "If circle, 
than Bite." Such differences in stage-one learning could critical!} affect 

Itamlns or perfonaarce during ffbage-two. 

In this section, the concern is with- a Methodology for asreseing Uta&Jft . 
iggnsi. The general approach is an outgro»th of aooa of onr .^lier reaea^xd,. 
Oreeno and Scandura (1965) found that in a verbal .co»«ept leainiffll situation, 

S either gives the correct response the first time he sees a transfer stimulus 

or the transfer item is learned as its control* 

The present author later reasoned that if transfer obtain* on the first 

trial, if at aU, then responses to additional transfer itens, under certain 

conditions should be contingent on the response ^van to the first transfer 

stlmauB. In effect, a first transfer sttoulu? could serve as 
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detes'iaxtte wlia": had bssn l5a2*n<'id durhij joatie'^caej. thereby jc^iJclng it possible 
'CD p'"ed'ict- 'Viiat -cespDnse 3 'tfO'.Jdd glVi »o -'i seiDOHu 'vriHiififcT /jXfXiiiUlus. The 
-••fsuj.is O'': i.\ pj..L-5t vfAcly ‘^'ece In those cases vhers transfer po-. 

tsntl-il 'ir. 2'5 in-llc&tadj tli:. re.^poriSitr :: tm '2 seooad set of stiar-uli conforasd 
to predict.loi'i in 48 i-f ir-H cases, Xn tici.s b lya?? felJ-cved to nabe a re** 

jsponse vhicb indlcat jd that ho r-slat.:. ;r5?h:lp ws noted bet‘f?een the leam'lng 
^.vd test mterials, Viitho'^t thie cont:rol . coiract responses' tc a transfer 
Sviaciulvis voud.d have occurred by chs.nca In about 3 out of 4 cases, 

A eittdlar uiethodoloKr was used in the cusa of principle leaiming -- a?scl 

• ¥ 

the results were id<=Jitical, tfuenever the Bs r^iSpoaded to one test eti»ulUB 
in. accordance vitb a i>rixieiple, the^f lu.so responded, according to the prin- 
ciple^, on 8ubtv,';qucjit stltmiii, 

te .ii3.u?strc,t:-.Yu purposes, again consld^u* the palxT,- sho'wn in Figure 3. 
c-lN,er tli 2 pi'^rs are learned, for the of the principle, 

'*If 'then olor., '* scl^w coi-iaiai; of ‘presenting the stiimilue,^^* 

response -youlxl indicate t:!iBt the p.;.itt‘!iipl« m\i operating. The 

aivticipcted re'spon'^e , 'Jv’ouid be ’^black..’' .If the stishili were outs-ide 
the icri.nciple' doicnaih or, indeed. If the principle^ bad net -been ••acquired, no 
such ;p'iredictio:a coulxl be r^ade, 

•iVjere t-^-o n^vjor questions 'that can. he asked of the aa»«QS!aent method- 
ology <I' 2 i 3 cr J.bed abovut. Cinder w^bat condititons can type cf learning be velidly 
3 .( 3 ss&£>ed arc und-^.r condit'ions does r^isponss consistency obtain? The 
utility of the. 'c-.os?sfci!i;€.-rt pr;)c-sdm-'D ’J.t:' largely dv“i')er.deitt on the ahili.ty tc 
contJ.'o3. sv.ch factors. 
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In order “to sinsver the first (juestiou, the effective variables and an 
acceptable criterion need to be identified. One such. «et -of variable* con- • 

cema the instructions given before the original leaftttng •ad. be^en the 

* • • • • . , • # 

original learning and the first teat. A feasible criterion, against vhich to . 
conpoK learning type, defined In teiaa of a. teat reaponae, la S' a verbal 

t # “ • 

port as to how he learned the items and/or as to the basis for bit test responies^ * 

• The predictive value of the laethodologjr, deScri^ is dependent on know- 

'* ■* • 

ing when S will employ the same responding Consistent of resp<Kise to . 

the test®Bti®4i may be influenced by feedback as wen as instruction varla- 






bles operating between the first cind second test responses. The effects o^ 
positive^ ne^tive, and neutral rainforcepaent of the. first test response may 
he cruc;?^. Telling 6 how he should resp^ or. indicating ttot "rules 
have changed^ " by liint or choice of test sitpxdli, may also affect response 
consistency*^ On the other haod|. .suggesting that the first response is appro- 
priate xds^ encourage use of the same responding, set on the second test* In 
our pilot experlwsRts, S .was told that he was correct regardless of how he re« 
Sponded to the test stimuli. Be' al^ was.ejicouraged to respond on the basis 

* • * 4 ' 

of hi# prior iMir^ng. In cffwrt, Jhe MRirlasnlia was dealffied to 

‘ I V ■ « 

control «ad capltallw,- an EUi BtoUn H g .for aeeeaiip^t snd predictive pur- 
poses* : Under these cpnditioiui/^^^^w oa both prlncliae 









' y, * 



a^ concept learning^ .was near perfect. 

Contrast the varlablas described hese with those typically m^lpu- 



■p*# 



-r 



^In the study ty Scandu^i'a^ ^foodvard, •pd many of .tj^se junior,- . 

hl^ 'schoca Ss who were presented with the . rple ^ 

the inapplicability pf the rule to 

in using tlie rule taught, whereas .ttbs' other €5rp^s ^re -hlg^y 

reason for this difference was atirlbut^/ly to prtpr le^ln^ . 

which indicated to the'Sa that tferfules;. 4i4^p; Hiii -0^^ ' ’■ * 
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Iftted In psychological settings. The latter variables, to which we refer, are 
those which might affect the probabilit y of priJiciple learning. In the situa- 
tions deecribed, such variables become boundary conditions to be altered so 
that the desired proportion of Ss attisin a particulAr principle. The assfc^B- 
wect irethcdology described v-ab aololy to dete«alae "what is learned" 

and when that leamir*g Affects perfonsance. 

The assessaent pioblmn, of cc/vnrge, ifi not al^S as si»ple as has been - 
depicted. It has already been polntccV out thiit dl^a«ait principles frequently 
'hove instaijjjss in ccrsBion (e<,g« ordered and ^cr3appiag i^nciples).. In order 
to detemlne which ojne of two (or sasre) prJ.nclples are operating, it is essen- 
tial that the test .pair beloiag only to the i^lnclple in question. 

‘ With actual subject x&atters, additioael. factors are involved. In the first 
place, it is not aO-ways oaity to specify uniquely, jil^e basis for an overt response... 
There is effen aore. than one path to the goal. As an illustration, consider a 
eituatW in' which. S is asked to + 3?- 551 as rapidly as pcss- 

iblOo g cira laboric?uf32y Endtiply 3.5 times kk9 and 35 times 551 and then add the 
prodUjfctS'Cjr he can recogniaie this a*5 axi instance where the distributive princl-; . 
plM would allow him to compute 35(^9^55l)”35'10^35,OQO. Clwly, it is • 

the response alone .which detexiainee what .is leanksd (i.e. the "way" in. which 
•jW/ja-ohles! is 8oHr.»d),. bat-thc^tteie It •ta&es 'Sw ' If tbs' ccrrect aammr s. 

'is glv^ in B stori; tiiw, the ^stilbutiv® jpf ineiple' was prohobly u»e4. Giving 
th® correct answer in a re3.ativcly long time would likely indicate the usual 
' CQss®^ 5 tatlox 3 fil rule. If S gives .an Incorrect answer, qr .if the problm Is so ... .. '• 
t?taay that thei'e would be little time dif f erentia’I' how B d^'.the • ' 

..1 .* . I ^ ^ * 






' '** ^ > 4 ” r , ^ -Jf •» -I • * \ 






^Such a situfttloa tsccqrs is. the l/niversi^ Schc^l ^theaatlce 

•Profpram (jack Eacoly, pe’rso?4BX co^ato ' 
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ccppittatlons, Jftjrther con®llcatAon8 are intrcducedo 

In order to better epecliy the eource of an cbeervable behavior, the care- 
,ful selection of test stlnull and lesponses Is essential. Ideally, these ele- 
Should not only indicate the liltellhopd <it the particular. pespi>eaiB« ret , 

• In qpieatlon, but . should ellinlnate edit other aXtematlvea, Although probably not 

.. .* '••• ^ , 

atialnible^ this Ideal can be approached in »any cafes'. 

Another problem Involved in work viUi actual subject matters Is that of coi^ 
plex ltyr More than one principle may, and usually dees, enter into a sin^e teirt 
response. 'To determine the learning underlying the response, it is onen neces-'* 
lary to individually assess for each principle. In other cases, it may be suf- 
ficient to simply test for the acquisition of the conjunction of principles;, as- . 
pects of the compound response often provide information about the elementary 
principles. For example, consider a gao® i** which the two players alternately 
select numbers frea a specified set of consecutive integers (including l) and* 

• keep a running sum, the winner being the one who picks the lest number, in a ser- 
ies with a predetermined sum. If this' sum is 31 nnd the set consists of the in- 
tegers 1-6, the players select numbers from 1-6 until the cumulative sim Is 

either 31 or above (in which. case no. one wins). .There Is a compovind rule which 

• * . ^ * ’ *' • ’ * , 

" allows the player, who goes first, to win any such game, ”W.vide one more than 
the largest nuuober in. the set into the desired sum • make the remainder the ^rst. 

- on subsequent trieii^jCOMistently se^et that. which when added to 

the qppcfOenVB pre.ccding cd«)^lce:cua» to, one more th^ the nunher in the 

There are actually tvb principles involved, involving selection .of 

• •• •* .*.• ."•»** 

the first nuBd>er and the other iayolvlng., sequence, choices. . To win consistently, 

.’ 4 ; ' 

a * a 

both principles may be employed,, one after tte otjtere The acquisition of each ■ 
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principl.e can be determined by analyzing distinct aspects of the Dve"all game plan.^* 
In many test situations, there are few available responses from which to 
choose (as in True-5^1se and J^ultiple Choice tests). Under these conditions, 
there are additional problems of assessment since there is a high probability 
of giving any particular response (by guessing) irrespective of learning type, 

A similar problem obtains in assessing concept learning. There are at least two 
ways of overcoming this problem; (l) present more than one test stimulus and (S) 
include appropriate controls (e.g. , Greeno and Scandura, (1965)» 

How to learn . - In addition to assessing what is learned, the problem of assess- 
ing learning process may also be considered. Although the problem is consider- 
ably iziore complex, the methodology involved is but a simple extension of that 
used to assess learning type. In addition to presenting to-be-leamed materiEj. 
and determining by assessment, what is learned, a second set of materials must 
be presented. Assessing what is learned, in the second case, would then provide 
a basis for comparing the manner in which the original and test (second) displays 
are encoded. This paradigm can be represented by; 

Learn A, present A test stimulus, learn B, present B test stimulus whereas 
the original assessment paradigm is represented: 

Learn A, present A test stimulus one, (present A test stimulus 2)., 

More is said about this problem in the next section on applications. 

^Thls problem has much in common with diagnostic work. There are also strong 
similarities with Gagne’s (19^2) approach to task analysis. Although it is beyond 
the scope of the present paper to show chat it is so, there is strong reason to 
believe that the SFL can be used to make these procedures more explicit. Such 
precision may be a necessaLry adjunct to more sophisticated diagnostic technolo- 
gies. 
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APPLICmOMS 

In this section, a variety of persisting problems ere formulated in SFL 
versus. In most cases, the. assessment methodology also plays an is^rteht role, 
Ihroblenus involving paired associate (PA) principle learning, expository emd 
discovery modes of instuction and cognitive developoDient are considered 

•a.* 

Principle Learning . » The question of relationships between S-R pairs seems so 
obvious, and so basic, that one wonders why it has not been studied exterusively. 
Because it provides a simple 'context- in which to contrast mediation and set* 
function formulations, the problem is described in some detail. 

Consider a PA context in which the relationships between four pairs are 
varied while the other factors ore held constant. In Figure 4, such a manipu* 
'lation is accomplished by selecting two principles, **If the stimulus is black, 

% 

Insert Figure 4 about here. 



then the rdsportSe’ is the nsi&e «f its shape” and “If whfte, then size,” The exper* 



^tlther areas of potential application include human performance (e.g. Posner, 
1S>64) and task analysis (e.g. Gagne', 1962 ), ■ 

Responding to the stimuli in Figure 4 would constitute an information conser- 
vation task according to Posner’s Cl964) classification scheme since there are 
four stimuli and four responses. According to a previous discussion, such perfor- 
mance could, however, be based on the acquisition of four one-instance principles 
or two two-instance principles. In the latter case, the task could conceivably 
act more like one of infom^tion reductio n (Posnei*, .19^4). In effect, performance 
(latency) coul.d depend on the learning 'anderlylng the performance. It is beyond 
the scope of this paper to go further into this area, biit the potential implica- 
tions are clear. 

The task analysis procedure of breaking down skills into components seems to 
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inental stjbmili, corresponding to these principles, are ail either black or 
white and have scene well-known (by the Ss) shape and size. The responses are 
Blifliply the shape and size muses. Xn the es^rlnehtal list, two jpalrs correspond 
to each of the two principles^ Hote that the control responses and stimulus prop- 
erties (l.e. size, color, shape) ora identical with those In the experimental 
list. In addition, this responses, In both, eases, are names of one of the prop- 
erties of the corresponding stimuli. Any differences in the leeumablllty of these 
lists would be hard to attribute to anything but the presence of relationships iM- 
tween pairs in the experimental list.^ 

The mediation description of the list contingencies, even as modified, 
leasrsB* mich to be desired% The representation of principle learning is rela- 
tively complex and would have been even more so had we not let the re represent 
the typically made dlfitinbtlon between the mediating responses and thdir as^n»ed 
stimulus properties. A more cniclal limitation is that the chain diagram sluipily , 
would not make clear. why Is the overt riMiponse yather than 1^, without 

the addition of the more, direct two-stage, chains ^Involving the rs^ (i«l,*..,4). 

As indicated previously, this 1-1 pairing does not follow from an analysis of the 



" be epite analogous to determining those learnings pirare^islte to a piutlcular 
porlnciple. The reader Is referreil to Qagn^’s stings (e.g. 1962,1964, 19^^) for 
a more detailed account of these Ideas. 

^It may appear that an appropriate control list, could be constructed by 
pairing the seme stimuli and refuses in ^random fashion. Alas, this turns out 
not to be a critical control. Ary differences between the groups can be attrib- 
uted to pre-experimental associations between stimulus properties (e.g. shape) 
and the corresponding responses (e.g., shape *'nsmes^) rather than to relationships 
between pairs. These considerations dictate that prior learning be either absent 
or equivalent In. both groups. The SFL requires that none of the control pairs 
have any (obvious) relationships (i.e. no principle includes more than one pair). . 
The experimental- lists shovm in Fig. 4 meet these requirements. 
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S-K links in the chain. 

In view of the increasing difficulties implied in dealing with still more 
complex problems, we cannot help but recall pre«Copemican epicycles and re** 
lated attempts to salvage geocentric theory. How much simpler when the facts 

i 

are expressed in the S5L. 

Judith Anderson and I conducted a pilot study that is relevant. Its pur- 
pose was to deteimine: (l) the effects of the number of instances per principle 
on the probability of principle learning and on the rate of learning and (2) 
relationships between principle learning and leai*ning rate. 

The materials to be learned consisted of j.ists of 12 pairs similar, in 
type, to those shown in Figure 5 . ’ Bach stimulus had a property relating to 
shape, borders, shading, outline, and color. Four colors and eight values 
. of the other four attributes were used to make a total of 32 stimuli. The 



BBSFORSE.. • 
dashed 



Fig, 5. This sample pair is ciiuular, shaded with crosshatches, outlined in 
dashes, has two borders, and is black! 



renponseB were labels attached to one of the non-color stimulus properties. • 

^ i 

Of the 12 pairs, in each list, six were instances of one principle (l6), three 
were instances of another (P3), two were insteuases of a third (F2), euid one 
was an instance of a fourth (Pi). In each case, the principle identifying 
cue was a color and the response deteralning class was either shape, borders, 

o 
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shading or outline. The four identifying cues and determining classes were 
randomly paired to form four principles (e.gv if black, then shape), which 
appeared equally often \mder each condition. 

The PA list was learned by the anticipation method to a criterion of 
three consecutive errorless trials. To determine .whether the principles had 
been acquired while learning the pairs, each S was sho\/n two additional lists 
of four stimuli each. In both lists, one stimulus was associated with each 
principle and each stimulus appeared only once, Hespondirig accoz^iing to the 
principle was presumed to indicate that the principle had been learned. 

Prior to learning the original list, each of the 20 college Ss was 
trained so that he was familiar with the stimulus dimensions and could name 
each stimulus property. These responses were typed on a card and were always 
available to S, In addition, 8 was told that a pattern was involved which 

might facilitate his learning and. guide his respoises to the transfer stimuli. 

« * 

The dependent variables were the average number of errors per Instance 
for each § (on each of the four principles) and the number of appropriate re* 
sponses to the test stimuli (two for each principle). 

•# 

Except for a very small reversal between tzeatments P3 and P2, the aver- 
age nufuber of errors per instance decreased with the mniber of instances per 
principle: 3.4, 3.5> and 2.7^ respectively df*3/7^, P^.05). . 

The difference between Fi sad P2 wm significant df«l/ 76 , p<^.05) 

but none of the other od^Jacent means differed significantly. A|iparently, the 

rate of learning increased sharply with the addition of a second instance 

* • # 

and then tapered off. 

The number of appropriate responses to the twsfer stimuli was also 
affected by the nuniber of instances per principle. There were 27, 8, 15, and 



o 
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9 appropriate respo&ses given to P6^ P3, P2, and PI transfer stimuli. A1- 
thou^ the trend vas not entirely regular, a sign test indicated that the de- 
gree of principle learning was higher in treatment. P6 than in the average of 
treatments P3, P2, and PI ( E ®2.6, p^.005).^ 

• ' I 

Another analysis demonstrated that 3,earnihg rate was related to P6 prin- 
ciple learning. Of those 9 3s who responded appropriately to both P6 test 
stimuli,^ 7 had below median {2,6l) error acores, indicating more rapid 
learning; of those 11 Ss who responded appropriately to at most one. test 
stimulus, 8 had above median error scores. An. exact test (Finney, 19^) 
indicated a significant relationshipbetween principle learning and learning 
• rate (p^.035).^ 



^It might be argued that the difference in the nmnber of appropriate re- 
sponses was due to there being more responses per category (e.g. shape) in 
treatment P6. %en in doubt, the Ss may have tended to give a response from 
the most fre.quently experienced category. A coinparison of the average number 
of V6 responses given to the P3, P2, and EL transfer stimuli (l6) was not 
significantly higher than the ten P3, and EL responses given to the P6 
stimuli (p^.lO), 

^It should be noted that the probability of giyihg two appropriate re- 
sponses in a row by chance-guessing is one out of l6.- TUnla fact precluded 
the possibility of obtaining signif leant relationships with respect to the 
other principles. Only 3, 5, and 2 Ss gave both desired responses to the 
P3, P2, and PI test stimuli, respectively. 

^It is interesting to note that Erickson (I 963 ), in studying the von 
Bestorff effect in a paj.red associate task, also obtained results which were 
difficult to explain in tenns of relationships between stimuli and responses 
(l.e. stimulus and response discrimination) rather than between pairs. 
Erickson (I963) found that an S-B pair which differed from the others In the 
list only in terms of the relationship between the stimluc and the response 
was learned faster than the other pairs. 
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Exposition and Disccrvery .^ Previous studies involving expository and dis- 
covery modes of instruction have not teen entirely consistent, even when 
the studies apparently have been well controlled (e.g. Craig, I956; Gagne 
and Brown, I96I; Haselrud and Meyers, 1958; Kersh, I958; Wittrock, I963). 
Part of the difficulty has been due to a lack of consistent terminology 
(e.g. see Wittrock, I965), but other problems are not so easily disposed 
of. Thus, Wittrock (1963) found that rule given groups perfonned better 
on a transfer test than did a discovery group. In . the Gagne and Broim. 
(1961} study, however, the results were reversed. 

In explaining the results of such experiments, recourse is frequently 
made to 'Vhat is learned*' (e.g. Gagne and &*own, I96I; .Scandura, 19©f-; 
Wittrock, 1963) . Unfortunately, such explcuiations are not only post hoc 
hut are without objective evidence independent of the experimental re- 
sults to be explained. A isore plausible alternative may be to make ex*, 
plicit an a priori distinction between logical and behavioral factors. 

Consider first some of the experimental factors involved in learning 
by es^sition. Frequently, a rule is presented directly and the Ss are 
tested to see if they can give appropriate responses to stimulus exem- 
plars of the rule. The scope or generality of the rule, however, is 
rarely made e^qplicit. This has not often caused real difficulty since 
scope normally is determined rather directly. Still it is important to 
note that scope is a logical vwiable dependent entirely on relationships 
between the rule and. its instanc.es. There are no behavioral questions 
involved. 
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The logical factors involved in discovery treatwents are more subtle j 
yet, they are at least as important. ‘Reaching by discovery xiaually iwrolvee 
presenting cwie or more instances and/or test stimuli and/or cues, hints, iuid 
other indirect guidance as to how S sh^ild ’‘process” information.. In one 
form of discovery, S is presented, in^tuTti, with several instances of a 
principle and then is ashed- to give the correct response to the stimulus 

0 I * 

muOfr of a new instance, -tn effect, the learner is required.tp abrtract 
a principle, much m he woiad in concept leamii^. ft another form of dis- 
covery, the learner Is shown csOy stiwttli and is given direction as to how 
he night detemine the correct reirp«nse. In both cases, the "discovery” 



of principles also necessarily involves leainiag ^ ^ acquire principles. 



The generality of the cues and hints given vbuft detersiiiw the range of . .. 

' applicability of the processing sKsde and the number and nature of such 
cues and hints would deterttiae the likelihood of . acqateing such a ao^ 

• « k - ^ ^ M M “ 

At one extreme, this could involve learning to acquire a siaple principle 
with no direction and, at the other extreas, liwolve being told a higher 



order principle for determining a broad clads of subcnrdinate prlnciplea. 







Again, logical factors appear to be as crucial as behavior vwiables in „ 

determining outcomes. ■ ' ‘-'v 

Unfortunately, there has beon littie ^ttei^ ii studies.on eaposlti^ 
and disco'jreryi», to predet^misve t««t st AiWfl , ! ;tnd 

sponses and presented Infonsstion. The. fo34y .of apt^d®^^ so beooaes 
creasingly apparent as asoz'e and more studies, Involving conplex aaterwJj^',,'/ 



demonstrate that structural factors can* be usually are) siore isqpoiv: 
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tant than traditional behavior vei‘ tables (Gagne, 1962ji Scandura, 1965b j 
Scandura anl Behr, 1965; Scandura, . Wooaaard, ^d I,ee, 1965).^ 

Determining relationships between presented and test materials is a 
logical problem. What S does with the information (i.e. test performance), 
however, is a behavioral question arxd depends bn a variety of factors, not 
the least of which is the ease with whicVt the presented material can be 
learned (Scandura, Woodmrd, and Lee, 1965)* In the stvdy described earl- 
ier (Scandura et al, 1965), predicting perfomonce was largely a structural 
problem contingent primarily on leamability. 

Nonetheless, most researchers (e.g. Craig, 1956; Haselnid and I'feyera, 
I95S; Kersh, 1953; Kittle, 195?, Wittrock, I965) have failed to distin- 
guish between variables affecting what learning can obtain and variables 
affecting whether learning does obtain. Treatments have differed both in 
their logical relationship to the test materials and in ease of learning; 
This has not affected the validity of the results, but it has nade them 
difficult to interpret. It is impossible to present really definitive 
ancdyses of most studies in this area since there has been no systematic • 
attempt to pre-experimentallif* specify structural factors. ?c be. defini- 
tive, criterion measures, as well as treatments’, need to be chosen care- 
fully for specific purposes. 

In the Wittrock stiidy, (1963), for example, rule ( given- xiot given) 
and answer (given- not given) wrere independently manipu3.ated . The rule 



I ^ ^Logical analysis has. probably not been' so crucial in traditional 

laboratory studies since, in most cases, relationships between presented 
and test materials are quite direct. 

# 
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and answer given grcup (RA) effectively was presented with two rules — 
one specific to one problem and the Ouher of a more general nature. V^bich 
rule (if not both) was learned by a majority of^Ss would depend on their 
relative ease of learning.^ The rule (R) and answer (A) given groups had 
no such cnolce. Group R was shown the general rule and a 3timJil>us on wlVich 
to apply it. The A Ss were shown an instance (S~R pair) of the rule and 
were given directions to. discover it. 

Rule learning may have been harder than answer learning, but only 
rule learning made it. possible to perform successflilly on the learning 
test which consisted of a new instance within the scope of the rule. 

The no rule-no answer (discovery) group was required to discover 
the rules independently as Mtere the A So- The .A Ss, however, had the 
advantage of seeing an instarice of the rule. The performance of these 
groups on the learning test gave some indication of the difficulty of 
the task with which they were confronted. Th-it the HA Ss performed 
SOTvewhat more poorly than did the R Ss (p=. Of) suggests that having an* 
easy to learn answer svailable may have detracted from learning the rule. 

Al'ter three weeks the rule- given Ss were better able to solve new 
problems based on new rules than were the discovery Ss. How was this 
possible? Unfortunately, it is impossi.ble to tell for sure. An intenfiliBS 
analysis of the ten rules actually used would be required. All that 
can be said with certainty is that the discovery Se found it harder to 

^In the study by Scandura, VJoodward, and Lee (3.965)/ the ’’answer 
given” groups learned be.st. 
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learn the rules than did the rule- given Sc- The discovery Ss were forced 
to discover rules ihde^ndently, those who were successful perhaps also 
acquiring skills appropriate for discCverlng new rules. ^ The rule given 
Ss were merely required to learn rules. The much higher level of learn- 
ing in the njile-given groups may have leore than compensated^ So far as 
novel transfer was concerned, for not having practice in discovery avail- 
able to them. Because of the complexities involved> the novel transfer 

t 

test, in effect, served only as an exploratory probe, for this measure 
to be definitive — that is, to demonstrate that giving rules is better 
than discovering rules — it would be necessary to equate rule learning. 

A study by Gagne' and Brown (1961), in fact, does indicate that . self, 
discovery may increase ability to learn when original learning, is equiva- 
lent and involves single- instance principle 3 .'(i.e. simple associations). 

In that study, the Ss were presented with number series, such as 1 , 2 , 

4 , 8, 16, 52,..., and were either given or required to find rules (i.e. 
formulas), depending on the number of terms, for finding the ra of series 
beginning with the specified terms and continuing with the sane pattern. 
After conpleting a preliminary program designed to acquaint the Ss with 
the caicept of a number series and a number of terms reXating to such 
series, three treatments were given. Then, after exposure to a series, 
one group (RE) was told the correct rule and presented with exanq>les, 

infcimtlon given in the published description of the study makes 

it impossible to determine whether sucJi an information processing skill 
coiad have obtained. • » 
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another group (GD) was given directions as to how to ^Ind such a rule, and • 
a third group (D) was told to find a rule and Vas ^ven sco» hints as to . > 

* ' I 

• ^ X 

what the rule sdfi^t be# Since the error rate on all prograsis was 
ably low^ it ni^^lit be assuned that the tV ee groa)pe learned .tlae desired 
rules to about the saaie degree. That is, after completing one of the, pro- ./ . * 
grains each S was probably able to state the correct iuXe when shown each 



of the series taught. The learning could have involved single associations 

between specific series and formlas, but, from de^scrlption of the pro- 

* ' # - * ■ « »' ■ ' * 

grams given, more likely was of a ccnceptual type involving series types. ^ 
The GD group, and to ‘a somewhat lesser'' estent the D group, had an oppor- 
tunity to discover a stianiius proceasing hulej a r^e which xsay.have 

« * » j* * ^ 

*• , ■ t ^ ^ ^ 

told then how to go about detern4ning A f^or finding the sum of 

a series shirm for the first time^ . The results indicate clj«arly that 
when original learning, in this case four series-formula pairs, is 
equated, discovery enhances the ability io learn; Although it was hot 
possible for Cagne and Brown (19^1) to -S|!ieclfy exact^. i^hat enhanced ' 

this ability, the present discussion suggests- that -the answer will be 

* \ * * * • . • ; # 

based on d logical, rather than behaviwgl, apaaysis. Once ..specified, C 

of course. It be ppssj^= "iiscovery” 

potential by stating a geneial 'p^ 

• ^ ‘ ■ • . • * * * 

Gdgne and Drown {I961) :^teipreted the treat»»nt efifhots os being 

• . . ^ ‘ ^ ‘ , * * 

due to differences in what hsd^been leaiim^* v ^ia was a legitimate th^ 
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to do. beaming how to find a formuia ^Ufferent from learning 
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to use formulas. The difficulty, I think, it one of j.ossihle confusion. 

The formulas, themselves, uere rules and such nilca have formed the basis- ‘ 
for most other studies ccnparing - expository and discw<ary modes of instru^ 
tion (e.g. Craig, 19^; liaselrud and Meyers, 195d; leri^h, 199&; W5.ttrock, 

1963 ). 

Perhaps one of the foremost arguments in favor of formulating re- 
search problems in the 8FL is that one i$ forced to consider relation- 
ships between presented and test materials. The relaticn^iihips become 
esqplicit in terms of the stijuull, responses, and principle involvi^d 

Thus, when the principle is a decoding principle (e.g. tflttrock, I 963 ). 

♦ • 

the stimuli are enciphered sentences, atsd the responses are deeipbez'ed . 
sentences. VIhen the responses are fbraulas, as in* the Qagn^^ and Brown " 
.( 1961 ) study, the principle involved is sene unspecified* higher order 

I ^ 

rule for determining formulas for finding the sums of series. The atim- 
uH, of course, axe the series theuselveso 
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^vereal and Ifonre veraal Shifts Figure 6 chax'actftrlzes reversal ajad nonre- 
versal shifts. The stimuli sho\m all liave two^ tvo-valued dimensions. On 



First Discrimination Second Discrimination 



Sniall Positive 




^Mg. 6. Ejcaaqples of a reversal and nonreversal shift. 



the first discrimination^ size is the relevant dimension; large and 

• « ♦ 
characterise two conce]pta, each with- a distinct response, + or -• Color is 

Irrelevant, ^fter S can reliably jncdie the first discrimination, a second 

discrimination problem is presented. A reversal shift involves exchanging 

ohe two responses, large going with - and small with +♦ A nonrcversal shift 

involves reacting to the color dimension, black objects going \dth + and 



white objects going with 
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Pre-verbal children (Kendler, Kendler and Wells, 19^^) and animals 
(Kelleher, 195^) find nonreversal shifts easier to make than reversal 
shifts whereas older more verbal Ss, find reversal shifts easier (Buss, 

1956; Barrow & Friedman, 195B> Isaacs and IXmcan, 19^2; Kendler aisd 
BMbcato, 1955)* In a .study with kindergarten Sl, Kendler and Kendler 
(1959) found that fast leairners, like verbal Ss, were be'^ter able to iinake 
a reversal shift whereas slov/ learners, like preverbal Ss, were better 
able to make a.nonjSiFSCBal- shift, it was suggested that the fast Ss 
approached the experimented task with verbal label's for the correct stim- 
ulus already strongly attached, the verbal label serving as a mediating 
lii^i in a tvb-8tage S-B paradlgB. The leamii^ of the slower learning 
and presumably preverbal Ss was asBumed to Involve a cixsglewstage paradigm.* 
Khndler and Kendler (1962) explained the relative ease of reversal and non- 
reversal shifts in terms of the number of ,S-H associations that need to be 
changed. 

This interpretation, however, leaves unanswered the question of whether 
the stated results were due to a progressive improvement in all children 
* with age or to a larger proportion of faster 3,earnlng children havi.ng what * 
ever characteristic it is that oskes reversal* shifts easier.- 

Befonnulating the reversal-nonreversal problem in the Slbli provides a 
basis for answering this question. Perhaps the relative ease of shift is 
dependent on "what is learned" on the first discrimination — two concepts 
(e.g. "If large, then and "If small, then or four discrete pairs. 

If concepts arc learned, reversal shifts should be easier since this men^Iy 
involves learning two new responses, principle identifying cues (e.i?:. 



larfte and small) remain constant. A nonreversal sViift would involve learning 
either two new concepts or four new discrete pairs « On the other hand, if four 
diflcrete pairs had originally been learned, a reversal shift would invol.ve 
learning four wsw rc-isponses, whereas a nonrsversal shift would involve ;,earn- 
ing only two. 

Of course, this interpretation is anaiogouB to that presented by S-R 
theorists (e.g. Goes, 1961; Kendler and Kendler, 1962). It Is the assess- 
mfint methodology which provides the means for defceimining whether learning 
type is related to relative ease of shift. 

In order to assess what is learried on the first die crimination, it is 
necessary to employ dimensions, such as color {e.g, black., shaded, white) 
and sba^ (e-S* circle, square, triangle), which have 0102*6 than two easily 
discriminated values. This procedure makes it possible to use two values of 
each dimension on the first discrimination problem leaving the others for 
assessment purposes. Thus, for example, tlie four training stimuli might be 
either black or shaded and a circle or a square « If reinforcement is given 
accciding to ’’color/’ the. assessment procedure might involve presenting a 
new discrimination problem in which the two stimuli have the two* color attri- 
butes, used during training (e.g. black and shaded), and the shape attribute 
not 80 used (e.g. triangle). Choosing, as positive, the object having the 
ftame color as the positive training stimuli, would be indicative of concept 
leaiTiing (on the training task) were it not for the high probability (^) of 
choosing this object by chance. Assessment "certainty," of course, can be 
ii^proved by using more test discrimination tasks. This is mad.e possible by 
in^reasliig the number of values per dimension. In order to jninlmize "strat- 



shifts" during the assescraent procedure reinforceicent should be given 
at each choice i>oint no matter what the response. ' 

There is another way of reformu}.atiiig reversal and nonreversal shifts in 
SFL terms. In an ioportant sense, the problems posed ai*e different from any 
previously encountered In this paper, ^ox one thing the stiraull are pairs of 
objects. In addition, the response determining attributes nay be relation* 
ships. Learning to discrimiMte between the object pairs shonm in' Figure 6 , 
for example, can be accomplished by learning a principle of the fom, "If 
shown two objects, then choose the larger one." Such a discrimination can . 
also be acccopllshed by learning each principle in 9oae nonredundant parti- 
tion of this principle*-.- c.g. **lf shown two objects and one IS large and 
black (white), then select large black (white) object."^ 

In spite of the apparent differences involved, this formulation also 
leads to a similar analysis of this shift problem. If a single relational 
principle is learned on the first discrimination, reversal shifts should be 
easier since this merely involves learning a nay response — pick *the smaller 
one. The principle identifying dinse-nsion, different sizes, remains constant. 
A nonreversal shift would involve learning either a new general principle or 

• j * * 

at least two new subprinciples in a partition. On the other hand, if the 

• . * * 

1 ' 

It should be emphasized that S need not be aware of verbal labels in 
order to use a principle. Here, the labels serve primarily a definitional 
function. 

It is also worth noting that discrimination learning may be viewed in 
terms of relationcil principles. Although not treated here, there is reason 
to believe that transposition ihenomena (cf. Hebert &. Kfants, I965) may also 
be reformulated In terms of prin'^iples. 



J.* 

original x^artition had been learned, a reversal shift vould involve learning 
at least tvo nev subprinciples, whereas a nonreversal shift would involve 
learning only one. 

Unfortunately, there is no relevant data, Becavse the hypothesis and 
questions raised have obvious implications for studying interactions between 
developmental and learning problems, research cn this question is urged. The 
results also would provide additional information conveming the appropriate- 
ness of the SFL and the related assessment methodology „ 



CoauervatlOD. veisus WonrConr.ervatioti,- Questions 2 'eiating tc the conservation 
(l.e. invarience) of certain concepts^ suoh as* aaoimt, height, and number {e.g. 
FLavell, 19^3), ci^mpriso ^tother probleai are.a in cognitive (^evelopnent that can 
be i«forinulated in SJL terms. 

Consider a pj.'\x;ed;ire that is frequently used to deteimine whether a child 
lias icamed to conserve "amount. " E shews the child tvo balls of clay, 
both of the same o?.ze. S is asked whether ^efc ball, in tui'n, contains mor*e 
clay tbisn the other or whether they contain the same amount. Invariably the 
normal child says they are the same. 3. then rolls one of the balls into t)!ie 
shape of a sausage as shown in display two of Fitjure 7 and then asks the same 
question. If the child, consistently sj«ys that they are the seoiiis, he is said 



Display One 



Display IVo 




O 






n 



Figijre 7. IVo displays designed to determine the conservation cf amount. ‘ 
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conaen^e ann)unt. If not, he in a nonconserver, 

.iccord'.ing to M Jfelroer,^ the nonconstjrvcr apparently responds on the 
basis of length. A SHi formu^tion of thl)> problem suggests that this is 
ladee<.\ the case.. In crdai* b3 properly fonmilat^ this question, however, all 
of ths i^iportaat ccramnalities lofttveer. the two object* in each pair must be 









personal cojwaunication^ 
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identi, ied. Not only do the objecta in display one have the same ainount_, but 
they Viave the same height, len£?th, width, and shape. 

The child could respond on the basis of any one or moxe of these attri- 
butes. The word "amount," or whatever word is used by E, can have an entire- . 
ly different significance for the child. The protlem, of course, is compli- 
cated further because these dimensions are not all independent. 

For the sake of argianent, assume tliat the chiid responds according to 
either amount or length, but not both. In display one, the values of both 
dimensions are identical. Thus, a correct response could Elgnify the opera- 
tion of either of the two principles, "If shown two clay ob,jects, then the 
response depends on the relative amount (length)."^ IHsplsy two might be 
presented to determine which of these two principles is operating. If the 
child now says the sausage contains more clay, tte operating principle prob- 
ably involved length. If he says "the same," amount was lilsly the deter- 
mining factor. It is worth noting that both of these two rer^ponses also 
eliminate height frcmi consideration as a basis ."'or responding. The former 
does not, however, eliminate shape . 

One further poiJit needs to be made. In de'/elopfieriteJ. situations of the 
sort described, conservation of amount is aasuoi?! to have beer, acquired or 
not acquired prior to the experiment. If S do.=i6 not demonetra.e conservation, 
it is not a question of choice but of necessity. The priaciplt of length may 

^It also could conceivably be of interest to distinguish letween volume, 
cross-sectional area, and weight, 

%hese two principles, of course, refer to the generic basis for the 
child ^8 response and iiave nothing to do directly with the princ;;ple E intends. 
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be operating, rather than that at amount, not ^uet because S prefers It, but 
because he has not acquired the concept of amount* 

This nay not necessarily be true* The child nay have previously eicquifed 
both the concepts of ancoint and length in a generic, nonverbal, sense.. All 
that can be said with confidence is that the phrase, "which contains nore clay 
nay be intei'preted by the child in one or the. other of two ways* 
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VmIpss to rathor 'cXi-no wAtifcUij'. ;tt. da i\naicip» 2 d that mura research 
' im heXp !iKX-i«St laioy >a' the tieirivl.wiil wr»t«ri«» att^h»d 
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ciple hecoma central;i this dlatlnction 1 a not aoade in th« S-R language. 
I^rhaps the prioary reason that such a distinction^ to ny knowledge^ 
not bean made before Is that in the classical learning types studied by 
S«R experimentalists (e*g«> association^ slirp3.e chain^ concept) > the id^« 

. V • 

tifying and determining cues are identical* The fhnctional elements in 
canon between ordinal and transfer stimuli serve both to identify siml- 
laritias (and differences) and determine responses* The dichcrtaoy involved 
in principle learning provides a direct basis for separating the pi^cholog- . 
leal processes of discriminating between stimuli and of responding to a 
stiflulus. 

Viewing* behavior in terms of principles also aakei possible a feMlble 
fenmaation of the vexing problem of ’’what is learned*” Aftar assttoptions 
have been made regarding the underlying stin^us values and disMshsiona^ . 
learning can be defined In terms of obsarvable test stiouli and responsos. 

It would appear that such an assessment methodology may provide a necessary 

• ' ^ ^ 

basis for an increasingly called for multi-stw© approach to co^lax learning 

and teaching* under these conditions such research may he eiqpeetcd to make 

• ** / 

more xvqpid progress* 

2h spite of the rather far reaching basiC' assumption made in this 
paper, ho attempt has been made to even outline a theory based cn the notion 
of a principle. It is certainly to be hoped that no. major dianges need to 
be made regarding fiuch. sacred principles as contiguity and reinforcement i 
Since they have guided much of the pireced.ing devalosanent, I feel 
obligated to make certain conjectures as to the possible nature of aone 
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additional theoretical assumptions . First, the response to a new stimulus 
is determined uniquely by the operating principle. Regardless of how 
principles are acquired, .behavior, according to the' present discussion, 
depends on the selection and use of some principle. Second, principles 
are assumed to continue in operation unless disrupted by the conflicting 
influence of new input and/or feedback* According to this postulate, 
responses to new stimuli remain under the control of a particular principle 
unless either the stimuli do not correspond to i^e principle or feedback 
otherwise indicates that the rules have changed* This notion has much in 
coBinon with the mathematics modeler. view (e.g., Bower and Trabasso, 1964; 
Levine, 1964; Restle, I962) that S changes hypotheses only when given 
negative reinforcement (however, see Suppes and Schlag-Rey, 1965)* 

Clearly, these two postulates do not, in themselves, oanstitute the 
basis for any theory; existing 

principles are modifiea and new principles are acquired. They are much 
like Newton* s laws of reaction and inertia .without i© F»ma to tie them 
together with the dynamics of change. Although some guesses ctwld be 
made as to the nature of such a postulate, it would be inappropriate for 
ms to speculate further at this point. 

One final comment seems in order. Nhen carried to the extreme, it 
beconms clear that the principle, rather than the. association, ^ the 
basic behavioral unit. No two stimuli or responses are exactly the same. 
Nhen we speak of a discrete association we are, in actuality, referring 
. to a set of S,R pairs in which the physical stimuli and responses are 



almost identical. 



If the preceding analysea hejw* any ¥ci^t at all, It would appear 
tha;t the future progrese of research on cooplex leamijig wcni instruction 
will be largely dependent on the ability to deal cieailtanv^oualy with 
structural and behavioral variables in both a precise and I'inple way* The 
8fl is presented •» one alternative. 
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USING THE PRINCIPLE :T0 FORMULATE RESEARCH 



ON MEANINGFUL LEARNING* 
(A Set-Function Language) 

Joseph n. Scandura 
University of Pennsylvania 



The search for a suitable scientific language in psychology has had 
a long history. Unfortunately > as v/ith theories » there is no ^ firiojl. 
basis for deciding between alternatives. Which will prove most useful 
can be determined only after a period of use. Nonetheless » certain char- 
acteristics appear desirable. One of these is precision. The primary 
requirement, however, is that the language accurately represent the 
important characteristics of the phenomena in question. Without such 
fidelity the language can have no real value— it is the sine aua non. 

In order to construe a precise descriptive language, which adequate- 
ly reflects meaningful learning, a basic behavior unit must be selected. 

The history of science has shown that the hypothesis-generating and pre- 
dictive value of any theory or scientific language is determined in large 
part by the appropriateness of its basic building blocks. 

Many theorists have been primarily concerned with extending S-R 
formulations to account for complex phenomena (e.g., Berlyne, 1965; 

Kendler & Kendler, 1962; Haltzman, 1956; Goss, 1961; Osgood, 1953; Staats 
& Staats, 1964). Although it has been repeatedly emphasized that the S-R 
approach is simply a means of working, of baring essentials, the neo- 
associationist iaiplicitly believes that the association provides the moat 

*Thanks are due John Carroll and Robert M. Gagne for their helpful comments 
on an earlier version of this paper, Felix F. Kopstein and Donald Payne for 
their friendly but trenchant criticism, and Judith Anderson for ner general 
assistance. 
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precise and efficient unit with which to describe behavior. 

The fundamental, and perhaps most questionable, assumption underlying 
neoassoclatlonlsm Is that mediating links In an S-R chain have the same 
properties as overt S-R associations (Berlyne, 1965, 17-19). In view of 
the success achieved In viewing animal and simple human learning In terms 
of associations, parsimony would seem to call for such a principle. Yet, 
practice has shown that mediation Interpretations become Increasingly 
cumbersome ,nd less precise as situations become more complex.^ Ilore 
Important, Anderson (1964) and Fodor (1965) have recently argued convinc- 
ingly that multi-stage explanations only give the appearance of greater 
explanatory power. Single stage formulations can always be devised which 
are equivalent. 

Largely for these reasons, other theorists and highly reputable 
writers (e.g., Ausubel, 1963; Bartlett, 1930, 1953; Dienes, 1963; Gagne, 
1962, 1965; Handler, 1962, 1965; Hiller, Galanter, & Pribram, I960; Piaget, 
as described In Flavell, 1963; Polya, 1962, 1965; Newell, Shaw & Simon, 
1958) feel that the S-R language does not capture the essence of meaningful 
learning. Typically, they find the Idea of an association or network of 
associations to be Incapable of reflecting all that a human does when 
confronted with a problem situation. Constructs are needed to enable ^s to 
think (Handler, 1965, 325). Thus, Bartlett (1930, 1958) speaks of organi- 
zation and rules, Gagne (1962, 1965) of knowledge and principles (and 
learning sets), Handler (1962, 1965) of structure, Hiller, Galanter, & 
Pribram (1960) of TOTE units (and heuristics), Piaget (Flavell, 1963) of 

^The present practice of repeatedly extending assoclatlonlstlc schemas to 
account for new facts, particularly as regards meaningful learning, is 
highly reminiscent of pre-Copernlcan astronomy. At that time too, emerg- 
ing facts were Incorporated Into what we know now to be an unnecessarily 
complex (geocentric) theory. 
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s lemas, Polya (1962, 1965) and liewell, Shaw, and Simon (1958) of heuristics 
and Tolman (see Hilgard, 1956, 191) of cognitive maps and sign-significate 
relations . 

Several of these writers (e.g., Hiller I960; Polya, 1962, 1965; 

wewell ^ 1958) have dealt with problem solving in its full complexity. 

Emphasizing the role of heuristics — broadly applicable modi operandi they 
have been either of the opinion that problem solving should be treated as 
an art (Polya, 1962, 1965) or that the computer (^^iller ^ I960.; Newell 

et 1953) provides the only really effective means for dealing with the 
complexities involved. The former vievj, of course, is antithetical to 
science. Computers, on the other hand, although they provide a valuable 
tool and possibly a viable model, do not alleviate the scientist of the 
responsibility for identifying the basic behavioral units and stripping 
theory of nonessentials. Computer simulation, due to the technical com* 
plexities and practical problems Involved, may be as much a hindrance a a a 
help in theory construction. 

Others (e.g., Ausuoel, 1963; Bartlett, 1930; 1953: Dienes, 1963.. 
Piaget, as described in Flavell, 1963) have also offered appealing analyses 
of meaningful learning and problem solving, but they have been forced to 
gloss over many subtleties. There has been no sufficiently precise lan- 
guage available for formulating their ideas. Although Piaget has made 
considerable use of logic in his theoretical work, it has served primarily 
to describe Internal capabilities. In tieing these capabilities to observ- 
ables, Piaget has simply used the French language, sometimes in rather 
abstruse fashion. 

In short, the choice, to date, has been between a precise, but seem- 
ingly inappropriate S-R language, and presumably more relevant cognitive 
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formulatlons which leave much to be desired In so far as scientific 

coheslveness and rigor Is concerned. 

Gagne’s (1962, 355) statement to the effect that knowledge allows one 
to "perform successfully on an entire class of specific tasks, rather than 
simply on one member of the class” Is Indicative of the fundamental differ- 
ence between meaningful and rote learning. Knowing how to add means that 
the learner Is able to give the correct response to any addition problem, 
not just one. vJhlle not necessarily Impossible for the neo-associationist 
to explain how ^s can give new responses to new stimuli, it aoes present 
difficulties. 2 In fact, the explanations offered (e.g., Berlyne, 1965, 
168-171) are not really explanations but independent postulates. 

Although Gagne <1964, 1965) has taken great pains to indicate how 
higher forms of learning, such as the principle , depend in turn on lower 
forms, such as the association , there remains the possibility that a higher 
form may be basic while the lower forms are simply special cases. Tracy 
Kendler (1964) has alluded to this possibility when, in reviewing Gagne’s 
(1964) paper, she suggested that new properties may emerge at the principle 
level. Gagne’s (1964) original representation of the principle did not 'jso 
the S-k language. He (Gagne, 1965) has since attempted to reconcile this 
difficulty, by viev/ing a principle as a chain of concepts. The analogy, 
however, is somewhat strained and in no way debases the suggested alter- 
native. 

Principles appear to be involved, in every act. We determine the sum 
of a number series according to some well learned rule or plan. We open 
the door when soraeone knocks, except possibly when working on a manuscript. 
Even as experimental subjects, we spew as many associates to a stimulus as 

^See the discussions below concerning stimulus-response generalization. 
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we can according to some plan of operation— a plan frequently introduced 
by the experimenter. 

According to Webster a principle may be defined as follows i "an 
underlying faculty or endowment; a governing law of conduct: opinion, 

attitude, or belief that exercises a directing influence on ^ife and 
behavior, laws or facts of nature underlying the v/orking of an ertificiai 
device (or human). *3 In short, the term principle can ba use^ to refer to 
that which underlies behavior. 

The purpose of this paper is to describe a precise (motb^matlcal) 
set-function language (SFL), based on the notion of a principle, which 
appears to capture much of what has been proposed as central to meaningful 
learning. The paper has been organized as follows. First, the basic 
rationale is described and the principle is given a precise mathematical 
characterizatlonj, largely in terras of sets and functions. Second, the S-R 

V 

and SFL languages are contrasted. Third, empirical research, which was 
based on a preliminary formulation of this language and which has aided 
in its development, is described. Fourth-, problems concerning reception 
and discovery learning, reversal and nonreversal shifts, Piagatian conser- 
vation tasks, and symbolic and concrete learning, are reformulated In the 
SFL. Fifth, theoretical direction is given. Finally, some concluding 
remarks are made. 




^The parentheses are mine. 
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THE PRINCIPLE— BASIC UNIT IN 
THE SET-FUNCTION LANGUAGE (SFL) 

To provide motivation for the mathematical characterization given 

below, consider the following situations: (1) a young child learns to 

say ''16'* when he is presented with the nominal stimulus "1 + 3 + 5 + 7’*— 

i.e., he learns to say the sum of the series represented (although he may 

not know what a sum is), (2) an algebra student learns to give the sura '16'* 

when shown any one of several representations of the series 1 + 3 + 5 + 7 

3 

(e.g., 1 + 3 + 5 + 7," “ I (l+2m)," etc.), and (3) a college student 

m*0 

learns a rule for finding the sum of any arithmetic series. 

Before discussing differences, consider first what these situations 

have in common. In each case, learning may take place in either one of 

two ways. The to-be-leamed material can be presented directly, as in 

reception learning, or the material may be learned by discovery. In the 

three situations described, the following statements wight serve to promote 

reception learning; (1) "If shown *1 + 3 + 5 -f 7/ then *X6*, and 

(2) "If shown any representation of the series 1 + 3 + 5 7^ then say '16', 

and (3) "If shown (any representation of) any arithmet^^ series, then say 

the numeral corresponding to (A L) ?g where A Is the first, L the last, and 

2 

N the number of terms, in the series'." In ac(^uirtng such abilities by dis- 
covery, the stimulus-response pairs involved may be presented, in turn, 
until the learner can correctly anticipate the nesf.4: response to a new 
stimulus within the same class. To acquire an underlying princip-1e, S 
must encode j^tli the stimulus and the response and discover a colmaon 
relationship between them. In situation one, of course, this felatlonship 
is simply a direct connection between the internalized stimulus and the 
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Internallzed response.^ In th^ other situations, the relationships are 
between the elements common to each representation o£ the related stimuli 
and those common to the responses. In situation two, the series, 1 + 3 + 

5 + 7, is an abstract entity (property) common to each stimulus, the commor 

response property is simply the internal representation of 16." In 

refer 

situation three, the common properties/^ t<> series of the form a + (a+d) + 
(a+2d) + ... + (a + (n-l)d) and numerical sums, respectively. 

The second characteristic common to these situations Involves perform- 
ance. Making appropriate use of knowledge, implies that an information 
processor is able to determine, not only what response to give to a class 
of stimuli, but also when to apply what he knows. To accomplish this the 
stimulus situation must be encoded in a manner appropriate to the context, 
the operation must be carried out, and the results must be made observable. 

The general context within which | selectj ^^laiPds presumably pro- 

vided either by directions or by Internal stimulation of some sort.^ 

Within a given context, however, stimulus properties determine which 
principle is appropriate. 

Suppose the Inputs of concern are general number series of the form 
ai + a 2 + ... + Within this context, the principle involved in situa- 
tion one, would be applied whenever "1 + 3 + 5 + 7" appeared. In situation 
two, any representation ofl + 3 + 5 + 7 would do. The principle, associ- 

^In this situation, each presentation of the single pair involved would have 
to count as a different pair— in fact, they are different if only in pre- 
sentation time. 

^It is possible that directions and Internal stimulation may be viewed as 
higher order principles of some sort, principles which narrow the range of 
subordinate principles which might be evoked in a given situation. This 
possibility is discussed to some extent in later sections, but detailed 
consideration is beyond the scope of this paper. 
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ated with situation three, would be applied to ar»y arithmetic series, 
number series in which a common difference exists between adjacent terms. 
Furthermore, all of the common /features need not be involved in determining 
the appropriate response to a given stimulus. In situation three, the 
first, last, and number of terms in the series would serve this purpose. 

In effect, encoding a stimulus may serve two distinct functions, to identify 
the appropriate principle and to determine responses. 

In many ways the problem of ascertaining what will be encoded Is 
easier with meaningful than with nonsense materials. Uith meaningful 
materials, because we have some basis for knowing what prior knowledge will 
be brought to bear on the decoding task, one can frequently assume that 
Inputs will be encoded in particular ways and that internal representations 
of responses can be made observable. For example, "5 + 2 x 3” will 
undoubtedly be Interpreted by the reader as "5,** "2 x 3 and not 

•’5 ^ 2,” ”x,'* ”3.'* The reason for this phenomenon does not reside in the 
stimulus but is based on a previously learned convention. Similarly, all 
verbal Sh can presumably write (and say) the numeral ''6” representing the 
(internalized) number 6. 

The nature of the relationship between the underlying pr:^nciples and 
the observable stimuli and responses also appears to be the same in all 
three situations. Whether the learning be by reception or by discovery, 
presenting test stimuli within and outside the scope of particular princi- 
ples would appear sufficient to determine what is learned. Since more than 
one principle may lead to the same response, however, the problem is not 
quite that simple. Questions concerning assessment are discussed in more 
detail in a later section. 

Although learning and performance are of an apparently similar nature 



in the three situations described, there are important differerxes. In 
situation one, the relevant attributes could be almost any combination of 
salient features in ”1 + 3 + 5 + 7." This is also true of the combining 
rule selected. A likely possibility, however, would be a simple mapping 
(i.e., association) between what is encoded and the Internal representatlo; 
of 16. In situation two, the relevant attributes may be common to each 
representation of the series 1 + 3 + 5 + 7. If only number series are 
considered, both of the sets [1, 3, 5, 7] and [1 (first term), 7 (last 
term), 4 (number of terms), 2 (common difference)] would serve to character 
Ize these communalitles. That none of these properties (e.g., number of 
terms, 4) can be used to distinguish between the various possible series 
representations, suggests that these properties are to be associated with 
sets of stimuli and not the stimuli themselves.^ 

Either set of common attributes can be used to determine the appropri- 
ate sum response. The terms 1, 3, 5, and 7 can be added sequentially and 

the expression ' ^1 (first term) + 7 (last term) 4 (number of terms)" can 

2 

be simplified. In effect, the se situations are entirely analogous to 

learning concepts which involve both attribute Identification and rule 
learning (Haygood & Bourne, 1965). 



^In mathematics, properties are often equated with sets of elements which 
have these properties. These elements, in turn,, are sets. Thus, the 
number two is defined as that collection of sets each having two elements 
(e.g., oranges, apples, things). The algebraic symbol, N, is a still 
higher order (more abstract) property and refers to a higher order 
collection which contains the various collections to which the Integers 
refer. In the same way, the series, 1 + 3 + 5 + 7, also refers to a class 
of stimuli— -those, for example, corresponding to the various ways of 
representing 1 + 3 + 5 + 7 (in numerals or words; with pencil, pen, type- 
writer, etc.). Even a property such as "red" is equated with a set of 
elements rather than attributed to a stimulus object* In most situations, 
of course, this distinction between a property of an object and the property 
Itself (which is equated with a class) can be Ignored. But there are other 
circumstances where the distinction is Important (e.g., whenever the 
psychologist wishes to distinguish between an (overt) stimulus property and ^ 
Internal mediating response (property) Itself. In what follows, I shall 
Ignore this distinction where the intended meaning is clear. 



Qr* » c 

In situation three, the relevant attributes are common to all/serics. 
Again, the communalities can be characterized by at least two sets of 
properties, * A, a^ • L, W, D* The difference 

between this and situation two, however, is more fundamental than simply 
involving a larger class of related stimuli. The responses also vary . The 
internal representation of the responses also is a property relating to a 
whole class of (sum) responses. The introduction of a response dimension, 
in addition to the stimulus dimensions existing in situation two, results 
in stimulus properties of a higher order in situation three in the follow- 
ing sense. These latter properties refer to collections (sets) of sets of 
arithmetic series, each of which, in turn, has a variety of symbolic repre- 
sentations. The first term. A, -or example, is a property of the class of 
those sets in which the arithmetic series have a common first tenn. A 

single combining rule (e.g., (AJKJi)^). by which the sum of any arithmetic 

2 

series may be obtained, could provide a basis ^t»r responding as indicated 
in situation three. This rule involves all that is common to a class of 
rules for determining particular sums (e.g., (1 •f 7) 4, (3 + 9)15, etc.). 

In short. A, L, W, and (A + L) m are of a higher order than 1, 7, 4, and 

2 7 

(1 + 7)4. respectively. They are said to be more abstract.' 

2 

In summary, learning may occur by either the direct interpretation 
of given information, such as a statement of principle, or by discovery, 
the abstraction of a relationship common to a whole class of stimuli and 



^Notice that it is easier to detect a particular stimulus property when 
presented with a lov 7 e?* order symbol, such as ”1,'’ than one of higher order, 
such as "A.*‘ Being able to determine any value of A, for example, 
necessarily implies that ”1,'* ”2," etc., can all be detern^med. Higher 
order rules and properties may be fundamental to most human behavior — 
particularly with respect to such subject matters as mathematics. 
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responses. In each case, the basic behavioral unit is an 

internalized plan of action whereby an information processor— human, rat, 
or computer— may determine which responses to make to which stimuli. A 
principle is not directl.. observable and its presence can only be inferred 
from the behavior resulting fron operating on particular stimuli. All 
observable inputs pertaining to human learning are not designed to elicit 
responses. Inputs may also serve to promote learning and to indicate the 
context in which principles are used. The presentations both of a statement 
of principle and of an S-R pair in paired-associate learning provide illus- 
trations of the former type of input; directions provide an illustration 
of the latter. 

In spite of these communalities , the nature of what is learned may 
differ. The learning may represent a simple relationship between the 
internal representation of a single stimulus and a single response; a 
relationship between representations (set propert/<i) of a set of stimuli 
and a single response; or a relationship between representations of a set 
of stimuli and a set of responses. In effect, two types of generality may 
be involved. The principle may refer to one stimulus and one response, a 
class of stimuli and one response, or a class of stimuli and a class of 
responses. In the latter situation, a second level of abstraction is 
imposed on the stimulus properties. 

Principles, denotations, and statements can be given more precise 
mathematical definitions. In order to distinguish one principle from 
another the following must be specs-fledi (1) the set of identifying 
properties (I), which make it possible to determine when the principle is 
to be applied, (2) the set of response determining properties (D), from 
which the response is derived, (3) the set of response properties (R) , 
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and (4) the combining rule, napping, or operation (0) by which R is derive*^ 
uniquely from D. In short, a principle may be characterized as an ordered 
four-tuple, (I, D, R, 0) where I, D, R, and 0 are defined as above.® 

The denotation of a principle consists simply of the set of correspond- 
ing Instances, the observable S-R pairs. This set may be denoted. [(S^, R^j^) 1 

i = 1, 2 and (Sjj,Ra) and (Sit,Rb) implies R^ ® Rbl- This definition 

implies that there can be only one response paired with each stlmulus—the 
denotation of a principle simply being a function^ in which the domain is 

®A principle may also be characterized in terms of description spaces 
(Hunt. 1962). Essentially, a description space is simply a product space 
in which the one dimensional component spaces are stimulus dimensions. 

Each dimension is partitioned into categories. For example, Consider a 
description space consisting of the three dimensions, size, color, and 
shape, with the values (categories) large and small, black, white, and 
green, and triangle, respectively. Any stimulus object with some combina- 
tion of these values may be placed in one of the resulting six (2x3x1) 
categories. 

A derived description space is simply another description space derived 
from the first by mapping points (descriptions) in the first space into a 
second space with at most the same number of dimensions and values. The 
derived dimensions and values may be identical or different from those in 
the original space. The set of properties and dimensions, D, corresponds 
to a description space; R corresponds to a derived space; and 0 corresponds 
to the mapping from one to the other. Notice also that Identifying the 
properties in I may depend on ’*operating' on lower order stimulus charac- 
teristics. Thus, a series is arithmetic only if certain properties hold- - 
e.g., the difference between adjacent terms Is a constant. Although it 
may ultimately prove useful to treat I as a derived space, along with R, 
this does not appear to be necessarv in most situations and will not be 
considered further in this paper. 

This characterization in terms of description spaces has cert Ain.* Advan- 
tages; it is precise, can be represented graphically, and emphasizes the 
distinctions and relationships bt;tween values and dimensions. It has the 
disadvantage, however, of requiring somewhat more mathematical background 
than is necessary for present purposes. For this reason and because of 
space limitations, no further use of these notions is made in what follows. 

^I^athematlcally, a function is a set of ordered pairs, (x, y), in which 
each value of the first variable, x, is paired with only one value of 
the second, y. 










the set of all observable stiuiull associated with the principle and the 
range Includes all of the observable responses. 

A statement must faithfully reflect the corresponding principle. To 
provide an adequate symbolic (observable) representation, it should make 
reference to all that characterizes the principle. This may be accomplished 
by a statement of the form. If I*, then R* • where D*, R', 

and 0* are symbolic representations for I, D, R, and 0, respectively.^® 
Primes are not used in what follows, except where necessary for clarity. 
Although statements can be given in different forms and such differences 
may be important, especially at early developmental levels, sitbsequent 
discussions are limited primarily to this form. 

To prove useful, a new scientific language must lu some substantive 
way represent an improvement over existing formulations. First, it may 
provide a more precise and parsimonious basis for describing Important 
phenomena than do existing languages. Second, it nay lead one to ask 
new and important questions and help make it possible to reformulate 
existing questions in more researchable form. If a new language should 
do only as well, and no better in both of these respects, one can seriously 
question the advisability of changing the scientist’s frame of reference. 
This is particularly true in the present case where the approach taken 
does not/represent a superficial departure or elaboration of .an existing, 
and apparently useful, language. 

In the following sections, SFL and neo-associationistic (S-R) formula- 
tions are compared and research suggested by the SFL is described. 

^®The role symbolism plays in learning and interpreting principles is 
discussed in a later section. 
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COliPARISOil OF SFL AND S-R LANGUAGES 

Although it also is tied inextricably to observables^ and is therefore 
behavioristic, the SFL is not ' associationistic, ’ nor even *'aeo-associa- 
tionistic, in the traditional sense of these terms (e.g., Berlyne, 1965). 

In the SFL, a clear distinction has been made between observables (e.g., 
denotations and statements) which induce or reflect internal motivations ur 
modi operand! and those Inputs on which such internalizations operate. To 
be sure, the organism is thought to operate on the stimuli; the stimuli do 
not operate on the organism (although, of course, they provide the occasion 
for making particular responses). 

In order to consider meaningful materials and the important role of 
Internal stimulation (motivation) , S-R analysts have chosen to extend 
empirically determined properties of overt stimuli and responses to medi- 
ating processes. In effect, the constructs are assumed to have the same 
properties as the observable phenomena they are seeking to explain. It may 
be this sort of circularity that has led some writers (e.g., Anderson, 1964 i 
Fodor, 1965) to severely question the explanatory power of intervening 
stimuli and responses. 

In short, the crucial difference between SFL and S-R approaches 
appears to be the nature of the construct used to tie observables togetnf^r. 
In the former, the association is the fundamental building block; in the 
latter, the principle is basic. 

Principle versus Mediating S-R Link . - Fundamentally, both principles 
and mediating S-R links represent a learned connection between one set of 
observables (stimuli) and another (responses). Mediating links, however, 
are often combined into chains of indeterminate length and various inter- 
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connections between different chains are often postulated for explanatory 
purposes. In the SFL, although a counterpart to chaining exists, only 
one-stage connect loiis are necessary* 

Of perhaps more concern, mediating S-R links refer to only one type 
of connection— associations between stimuli or mediating stimuli and 
mediating responses or responses* The combining operation, 0, referred to 
In the SFL characterization of a principle, is more general and refers to 
a mapping between mediating responses (e.g., D) and their stlmuUis 
properties (e.g., R). The almost exclusive use of the association in 
representing simple learning, both animal and human, has not greatly 
hindered American behaviorlsts since general rules or mappings have played 
no Important part in their research. Even research on concept learning 
has been largely limited to problems which have lent themselves to S-R 
mediation arguments. It has only been very recently that the rule learning 
aspects of concept learning, for which no ready associatlonlstic representa- 
tion is available, have been dealt with explicitly (Haygood & Bourne, 1965). 
These authors Independently recognized that concept learning Involved both 
(stimulus) attribute identification and rule learning. The present argu- 
ment, of course, is that rule learning plays an even more important role 
in the learning of principles which relate to a class of responses (a,^ 
well as a class of stimuli). 

Another major difference between the SFL and S-R languages involves 

the Internal representations of stimuli* In the SFL, certain aspects of 

the stimulus may serve to cue the principle and other aspects may determine 

the response. With respect to the principle, "If arithmetic (l.e., there 

exists a common difference), then sum equals (A 4- L) w.” for example, the 

2 

identifying (common difference) and response determining (A, L, and N) 
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characteristics of arithmetic series are distinct. In the S-R language nc 
such distinction is made; the same functional stimulus serves both functions. 

Distinguishing between the roles played by stimulus properties forces 
stimulus^ response* and etimulus^response generalization, as originally 
postulated by Hull (1943), to be viewed in new light. Generalization 
phenomena are provided a rigorous basis in terms of *‘what is learned. 

The properties in I determine the variety of stimuli to which a principle 
l 3 applied. The fewer characteristics required to identify a principle, 
the more widely applicable the corresponding principle will be. This 
follows directly from the fact that the number of eligible stimuli varies 
inversely with the number of required properties (i.e., restrictions) 
Imposed. The properties in D, on the other hand, determine what the 
corresponding responses are. The number of responses involved depends on 
the numbei ’ f response determining dimensions (n) and the number of 
distinguishable values of each. Each n-tuple of determining cues (one for 
each dimension) corresponds to one response. For example, the properties 
(i.e., dimensions) *'color’' and "shape" (in D) would result in a larger 
number of responses than "color" alone. The more different stimulus 
properties referred to by the encoded dimensions, the larger the denotation 
of the principle. 

It is Important to note that D and 0 may make it possible to determine 
responses (that may not be correct) to stimuli outside the scope of a given 
principle. For example, leads to the incorrect ''sum'' 9 to the series 
2 4 4* 6. In an analogous fashion the properties in I may overly restrict 

the set of stimuli to which a rule is applied. The SFL characterization of 
a principle provides, in effect, a basis for independently manipulating 
those cues (I) responsible for generalization (and discrimination) on the 
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Btlnulus side and those cues (D) responsible for generalization on the 
response side.^^ 

As we shall see below > the stimulus identifying and response deter- 
mining cues are Identical with respect to the learning of associations 
and concepts, which have provided the subject matter for the vast majority 
of experimental studies* Since the S“R language makes no explicit provi- 
sion for dealing with the phenomena, it is not surprising as Beriyne 
(1965, 169) recently points out, "Stimulus-response generalization, in 
which the formation of an association between S^ and results in the 
formation of an association between S 2 and R|^, has, ••*, scarcely been 
investigated at all.” This is hardly a desirable state of affairs if 
principles are as critically Involved in meaningful learning as has been 
proposed (e.g., Gagne, 1962, 1965; Scandura, 1967, 1967). 

lAiile on the topic of generalization, I might add that the present 
concern with discrete variables is not as limiting as might be expected. 
First of all, the examples chosen suggest that the critical values of 
meanin'^ful stimuli can often be readily distinguished. The numeral ' 6 
represents the number six and not five. Problems involving "just notice- 
able differences (jnd),” as in psycho-physics, are rarely of primary 
concern with subject matters like mathematics and probably others as well. 
In some ways, the analysis of actual subject matters may be less difficult 



^4he third type of generalization postulated by Hull (1943), response 
generalization, may in reality be nothing more than stimulus-response 
generalization. Although maintaining Hull's (1943) original trichotomy, 
Beriyne, (1965), admits that differences between response generalization 
stimulus-response generalization may be primarily a matter of conven- 
tion. In view of the preceding discussion, in which counterparts for 
stimulus and stimulus-response, but not response, generalization were 
Identified, it may be advisable to map this theoretical trichotomy into 
a dichotomy. While not denying the fact of response generalization, I 
am questioning its theoretical necessity. 
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and more precise than that of nonsense syllables. Second > the generaliza- 
tion gradient, almost always noted when training and transfer stimuli 
differ along a continuous dimension, may be an artifact due to averaging 
over different Jnd. If this latter supposition proves sound, it could have 
Important Implications for the implicitly assumed probabilistic basis for 
behavior. It may be that probability enters the situation largely because 
the properties in the set 1 have not been identified. 

Data reported in the next section suggest that there Is no decrement 
in performance on training and transfer stimuli (within the scope of a 
principle) when the stimulus cues are discrete and readily determinable. 

Classification of learning types * - Current Interest In taxonomy 
development has been Intense (e«g«, Melton, 1964; Stolurow, 1964). More 
important, there has been a concerted effort to uncover basic similarities 
between what have heretofore been considered separate categories (e.g., 
Fitts, 1964; Gagne, 1964, 1965). The emphasis has been towards genotypic, 
rather than phenotypic, bases* 

Perhaps the most encompassing classification scheme of this sort, 
which is based primarily on S-R terminology, is one proposed recently by 
Gagne (1964, 1965). This taxonomy provides a natural, and I think funda- 
mental, basis for comparing the S-R and SFL languages. The association 
is the basic building block In Gagne's formulation; most of the higher 
forms of learning represent conq>llcatlons of this unit* When reformulated 
in the SFL, the principle becomes basic and the simpler forms special cases. 

Gagne (1964, 1965) identifies eight types of learning: (1) signal 

learning— the establishment of a conditioned response which Is general, 
diffuse, and emotional, and not under voluntary control, to some signal, 

(2) S-R learning— making very precise movements, under voluntary control. 
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to very specific stiiauli, (3) chaining— connecting together in a sequence 
two (or more) previously learned S-R pairs, (4) verbal association— a 
subvariety of chaining in which verbal stimuli and responses are involved, 
(5) multiple discrimination— learning a set of distinct chains which are 
free of interference, (6) concept learning— learning to respond to stimuli 
in terms of abstracted properties like "color,'* "shape," and "number," 

(7) principle learning— acquiring the "idea" involved in such propositions 
as "if A, then B" where A and B are concepts; a chain or relationship 
between concepts, internal representations (of concepts) rather than observ- 
ables being linked, (8) problem solving— combining old principles so as 
to form new ones. 

According to Gagne (1965, 30-31), these varieties of learning were 
determined in accordance with the conditions required to bring them about. 
Thus, for example, the preconditions for signal learning are the nearly 
simultaneous presentation of two forms of stimulation, UCS and CS. Those 
for principle learning are the prior learning of related concepts and the 
chaining of these concepts. 

This scheme has the advantages of having a practically Important base 
and of at least formally relating simple learning types with more complex 
forms like Gagne's principle (GP). (To minimize the possibility of 
confusion 1 shall use GP to refer to the sense in which Gagne has used the 
term principle). Still, there are Important limitations. Gagne's original 
representation (1964) of principles and problem solving did not use the 
S-R language. In his later formulation (Gagne, 1965) he has attempted 
to show similarities by considering principles to be chains of concepts, 
but the analogy is somewhat strained. Concepts are not directly observ- 
able whereas the S-R links in a chain can supposedly be made so. Using 
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chaining inechdnisiiis in both situations may indicata sitsilaritias which arc 
more apparent than real* 

No such difficulties are encountered when the learning types are 
represented in the SFL with the counterpart of GP taken as basic. GP may 
be characterized (I, D, 0, R) where the properties in I 

and those in D refer to stimulus dimensions (i.e., higher 

order properties) » 0 refers to a class of operations (i.e*» an abstract 
operation relating a set of stimuli and a set of responses), and R refers 
to a class of responses. The denotation of such a principle, symbolized 
[(S^, Ri)| i - 1, 2, ... and for all S]^,(Sk, Kg) and (Sj^, R|,) implies 
Ra “ ^bl » consists of a set of S-R pairs with a variety of different 
responses and with each stimulus paired with only one response. The repre- 
sentation of a principle statement in the SFL, **If I, then R “ 0(D), “ is 
of the same form, but more detailed than that, ’If A, then B,** used by 
Gagne (1964, 1965). 

The other types of learning, identified by Gagne (1964, 1965), turn 
out to be either special cases or a composite of principles. Concepts are 
simply principles in which the properties in I “ D refer to a class of 
stimuli, but are not dimensions, and those in R refer to a single response. 
Although not a necessary condition, . 0 is simply a logical rule for combining 
relevant attributes HSi most studies of concept learning ’(e.g. , Haygood & 
Bourne, 1965; Bruner, Gooduow, & Austin, 1956; Hunt, 1962). The denotation 
of a concept, symbolized ((Sj^, Ri)|Ri i“l» 2, ***], consists of 
a set of S-R pairs where R is a constant. 

^^All many-one S-R pairings are not concepts in this sense. They may also 
consist of a number of discrete S-R associations with only one response. 
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1 ^ 

In order to represent simple S-R associations and sign learning,**-^ 
the SFL formulation is restricted still further. The (properties in) 

I - D refer to a single stimulus, R to a single response, and 0 to a 
simple mapping (l.e., association) betv/een D and R» In this case, the 
denotation is a set consisting of a single S-R pair. 

Notice that in much association learning, the response learning phase 
(l.e., the connection between mediating stimuli and overt responses) is 
of considerable importance (e.g., Underwood & Schultz, 1960). Of primary 
concern in higher forms of learning are the identification of critical 
stimulus cues (l*e., I and D which correspond to mediating responses) and 
learning the combining rule. The internal R-overt response pairing, such 
as between numbers (internal) and numerals (overt) , are typically well- 
leamed prior to concept and principle learning. Nonetheless, the 
application of a learned principle (statement) involves the perception 
(or determination) of the relevant stimulus properties which, in turn, are 
transformed into internal response properties (which previously have been 
tied to observable responses) . In those cases where ^ is unable to 
determine the critical stimulus properties, carry out the rule, and make 
the results of the rule observable, successful application would be 
impossible. Thus, for example, applying ^ principle, where 0 = (^l±Ji)n 
and D » (A, L, N), to find the sum of an arithmetic series, involves 

^^Slgn learning can be viewed as a principle which might be stated, *'If a 
light appears, then elicit a worry response (since a shock will follow). 
Clearly an animal cannot learn a verbal statement of this sort, but he can 
operate according to some such plan. There seems to be no ^ PJEiSEi reason 
why animal, as well as human, behavior cannot be viewed in terms of 
principles. On the other hand, there may be no important reason to do so 
since animals have no language system for transmitting principles with all 
that undoubtedly implies for complex forms of learning and behavior. 
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determining the properties in D, The determination of N, however, has 
proved to be a non-*trlvial task for junior high school ^s (Scandura, 
Woodward, & Lee, 1965). 

Wot having acquired the necessary 'perceptual "(vr operation, response 

learning) skills required to apply a particular principle, however, does not 
necessarily Imply that S cannot make use of a different principle having 
the same observable characteristics (denotation). It is often possible lo 
start at a lower level— with more readily discernible (stimulus) proper- 
ties. The number of terms, K, in an arithmetic series, for example, can 

be derived from the formula 0* i (L C - A) . where D' « (A, L, C) and C 

C 

is the common difference between adjacent terms, C presumably being easier 

to determine than N. In this case, the derived rule, 0” = (A^ (LfC;:^, 

2 C 

where D* » (A, L, C)^ can be viewed as a composit e operation with 0"(D') = 
0(0*(D')1 = 0(D). In this formulation, the operation O', of course, must 
be viewed as mapping each value of A, L, and C in D' into the corresponding 
values of A, L, and N in D. A composite operation, of course, has the 
same properties as any other operation. Unless the responses i.’orrespond- 
ing to each constituent operation rreactually observed, guessing the stages 



l^According to this view, determining characteristics and perceiving 
characteristiCG are one and the same process— the difference simply being 
one of degree. Thus, just as ’*16" can be derived from '’1," ”7," and '‘4," 

"4" can be derived by counting the numerals in "1 + 3 + 5 + 7"— the numerals 
being still more easily discernible (th^ "4"). 

In most experimental studies, ‘Involving percepfloh \ d ir ectl y , the 

concern has been primarily with less abstract characterisHcs— typically 
physical properti.es of the stimulus. In such cases the derivation rules 
are probably well learned. Still, all ^s at one time or other had to 
learn how to determine even physical characteristics. The new-born infant 
presumably is very limited in its abillf''^ to decode Informat:’ on. Perhaps, 
it is the few such abilities that it does have, such as withdrawal 
tendencies from aversive stimulation, which provide a basis for determin- 
ing higher-order abilities. 
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a learner went through would appear to serve no useful purpose 

The ability of the SFL to effectively represent multiple discrimina- 
tion learning depends on the principle identifying properties* Presumably, 
a number of discrete S-R pairs become well-learned and interference free 
when the sets, I, of identifying properties, corresponding to the distinct 
pairs, are disjoint* The denotation of multiple-discrimination learning 
would consist of a fiuaber ofcdistlnctvonat^pdir-isets* 

Notice that the principles Involved in multiple-discrimination 
learning can be of a more general sort. Moreover, such principles can 
relate to a superordinate principle. The statement, "If a number series, 
then the sums may be obtained by sequential addition," in some sense refert; 
to a principle of higher order than do the statements, ”If an arithmetic 

series, then sum ■ (A 4* and **If geometric, then etc* 

2 1-R 

It is presumably this sort of knowledge that Gagne (1964, 1965) has in 



^^Composite operations appear to correspond to the notion of chaining in 
the S-R mediation language but as suggested above the former correspond 
to connections between mediating responses and their stimulus properties 
rather than between stimuli and responses. 

l^It is worth noting that a single principle may refer to the same 
stimulus class as do a number of more restrictive principles taken collec- 
tively. The principles, "If large and black , then (the response varies 
with) shape" and "If large and white , then shape," may be considered 
special cases of the principle, "If large, then color and shape." The 
latter, more general, principle at once has fewer identifying cues and 
more response determining dimensions than the other two. In effect, it 
appears that critical response dimensions are traded off with critical 
principle Identifying cues. The more general the principle, the more 
stimulus attributes vary with the responses; the more specific the principle, 
the more stimulus properties are required to identify the principle. The 
total number of critical properties remains constant. 
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mind when he refers to problem solving. 

This discussion provides another powerful argument for adopting the 
principle, rather than the association, as the basic unit of behavior in 
meaningful learning. Dot only is the principle able to account for 
structure, without postulating complicated chains and hierarchies of 
unobservables, but the principle has the Important advantage of mathematical 
parsimony. When stripped of non-essentials, the set of observable stimuli 
and responses, which constitute the denotation of a principle, corresponds 
exactly to the mathematical definition of a function. The association, 
on the other hand, is a special case. 

The role of directions . - Directions serve an Important role in 

t 

almost all experimentation with humans. With meaningful learning, this 
role may be critical (e.g.. Haler, 1930; Gagne, 1962, 1964). According 
to Gagne (1964, 305), directions may serve to: (1) identify the terminal 

1 ^ performance required, (2) Identify parts of the stimulus situation, (3) 

aid the recall of relevant subordinate performance capabilities, and 
» (4) channel thinking. The characterization of a principle as an ordered 

four-tuple (1, D, 0, R) reflects each of these functions. R refers to 
the desired class of responses; I and D refer to the critical stimulus 
cues; and information about 0 may serve to aid recall and to channel 
thinking. Giving ’’complete" directions, of course, would amount to 
stating the principle. 

^^In contrasting Gagne's classification scheme with the present formula- 
tion it is well to keep in mind the difference in purpose. Gagne (1964, 
1965) was concerned with classifying learning according to the necessary 
and sufficient conditions for its occurrence. Present concern has been to 
show how the SFL provides a valid base for formulating each of these learn- 
ing types. Although the preceding discussion suggests that the necessary 
preconditions for learning can also be derived from the SFL formulation, 
further discussion is beyond the scope of this paper. 
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in other situations, however, it appears that directions serve a 
higher order role. They seem simply to define the context, to limit the 
number of principles which might be evoked in the given situation. This 
function Is served, for example, when ^ Is told to find sums, thereby 
making It possible for him to restrict his attention. Such directions 
would appear to serve a motivational function^ it Is by such direction 
that psychologists attempt to manipulate the motivational factors involved 
In learning. Detailed consideration of the underlying mechanisms is 
beyond the scope of this paper but they would appear to be compatible 



with the SFL. 



EMPIRICAL RESEARCH 



The research reported in this section was formulated and completed 
prior to the development of the SFL in its present; but still preliminary, 
form. In the beginning, there was just a vague feeling that a new approach 
to meaningful learning was needed. The later research was based on a 
preliminary formulation of principles solely in terms of their denota- 
tions. I have tried to indicate both the chronology of this development 
and some of the pitfalls which helped to shape my thinking. 

Pilot Research on Response Consistency . - During the summer of 1962, 
Greeno and Scandura (1966) found, in a verbal concept learning situation, 
that essentially S either gives the correct response the first time he 
sees a transfer stimulus or the transfer item is learned as its control. 

I later reasoned that if transfer obtains on the first trial, if at 
all, then responses to additional transfer items, .under' certain conditions, 
should be contingent on the response given to the first transfer stimulus. 
In effect, a first transfer stimulus could serve as a test to determine 
what had been learned during stage one, thereby making it possible to 
predict what response ^ would give to a second transfer stimulus. 

To test this assumption, I had a number of pre- and post-doctoral 
psychologists over-leam a short list of S-R pairs like that shown in 
Figure 1. Prior to learning the list, both the Ss and E agreed on the 
relevant values and dimensions— size (large-small), color (black-white), 

l^The learning (list 1) and transfer and control (list 2) stimuli were 
Underwood and Richardson (1956) nouns, the responses were nonsense 
syllables, and the lists were learned by a self-paced anticipation method. 
The transfer stimuli belonged to the same concept category as one subset 
of the learning stimuli. 
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LEARNING 



TEST ONE 



TEST TWO 




*♦- black 




A 



-► large 





A 



-► white 




“► small 



Fig. 1. Sample learnings assessment (test one), and prediction 
(test two) stimuli and responses. 



and shape (circle, triangle) • The ^s were told to learn the pairs in the 
most efficient maauer possible as this might make it possible to respond 
appropriately to the transfer stimuli. After learning, the test one 
stimuli were presented and the ^s were told to respond on the basis of 
what they had just learned. Positive reinforcement was given no matter 
what the response. The test two stimuli were presented in the same manner. 

The results were clear cut. Most of the £s gave the responses, 
black and large, respectively, to the two test one stimuli (see Fig. 1.), 
and those who did, almost invariably responded with white and small to the 
test two stimuli. On what basis could this happen? It was surely not a 
simple case of stimulus generalization; the responses given did not depend 
solely on common stimulus properties. The first test one stimulus, for 
example, is as much like the fourth learning stimulus as the first (see 
Fig. 1.). 

Perhaps the simplest interpretation of the obtained results, is that 
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nost of the Sb discovered the two underlying principles during list one 
learning and later applied them to the test stimuli* These principles 
might be stated, If (the stimulus is a) triangle, then (the response is 
the name of the) color" and 'If circle, then size." The former principle 
is characterized by letting I ■ (triangle, color), D ■ R ■ (color), and 
0 ■ identity mapping* The corresponding denotation would be [(S, R)|S is 
a colored triangle and R is the name of the S color]* 

The results obtained in this miniature pilot experiment (which I 
have repeated a number of times) provide support for the contention that 
principle learning is an all-or-none affair* The Ss either learned the 
principles (S-R relationships) or they didn't; there was no difference in 
test one and test two performance* Not all of the £s, however, learned 
the two principles Indicated above* Apparently, some of the £s ignored 
the similarities between the four pairs and learned them in rote fashion — 
i*e* » as four distinct principles, each involving an object and the 
corresponding word response* Under such circumstances, random test 
performance would be anticipated* 

The results of another pilot study, conducted at the University of 
Michigan during the summer of 1963 and reported by Scandura (1966) , were 
also revealing* In this case, Underwood & Richardson (1956), high dom- 
inance nouns were used, nouns which elicited an adjective associate with 
a frequency greater than 50SJ. Eleven college ^s over learned a list con- 
sisting of four piTirs of stimuli representing four adjective categories. 
Both stimuli In a given category were assigned a common response* The 
four test one and four test two stimuli were also high dominance nouns 
selected so as to represent each adjective category* The task was put 
in the context of a game and £ had the option of responding to the test 



stimuli with* "I don’t know," when none of the four learned responses 
seemed appropriate. Without this control, appropriate responding to a 
transfer stimulus would have occurred by chance in about one out of four 
cases. Again, positive reinforcement was given for all choices. 

The results with these concept materials were equally revealing. 

In '^se cases where transfer potential was indicated, the responses to 
the second set of stimuli conformed to prediction in 47 of 52 cases. 
Furthermore, when asked, all but two of these ^s correctly identified the 
common adjective as the basis for their test responses. 

Nonetheless, these results cannot be Interpreted as unambiguously 
as in the first pilot study. All of the Important stimulus dimensions, 
could not be identified a priori . In concept learning experiments, using 
the Underwood and Richardson (1956) materials, it may be desirable to 
assume that the set of determining properties, D, equals the common 
adjective. 



^^ith actual subject matters assessment sometimes presents additional 
problems. In the first place, it is not always easy to specify uniquely 
the basis for an overt response. There is usually more than one path to 
the goal. Consider, for example, a situation, which occurs in the Uni- 
versity of Illinois Committee on School Mathematics Program (Jack Basely, 
personal communication) , in which ^ is asked to compute 35 • 449 + 35 • 
551 as rapidly as possible. ^ can~laboriously multiply 35 times 449 and 
35 times 551 and then add the products or he can recognize this is an 
Instance where the distributive principle would allow him to compute 
35(449 + 551) « 35 • 1000 « 35,000. Clearly, it is not the sum alone 
which determines what is learned (l.e., the "way" in which the problem 
is solved), but the time it takes to respond. If the correct answer is 
given in a short time, the distributive principle was probably used. 
Giving the answer in a relatively long time would likely indicate the 
usual computational rule. In either case, the principle used would be 
reflected in the denotative response unit chosen. If S gives an incor- 
rect answer or if the problem is so easy that there would be little time 
differential no matter how S does the computations, further complications 
are Introduced. 

In short, the careful selection of test stimuli and responses is essen- 
tial in order to assess knowledge. Ideally, these elements should be 
chosen so as to eliminate all modi operandl but the one in question. 
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Principle Learning , - The question of relationships between S-R 
pairs seems so basic, and so obvious, that one wonders why it has not 
been studied extensively* Because it provides a simple context in which 
to contrast mediation and set-function formulations, the problem is 
described in some detail. 

Consider a paired-associate (PA) context in which the relationships 
between four pairs are varied while the other factors are held constant* 
In Figure 2j such a manipulation is accomplished by selecting the two 
principles indicated by, **If black, then shape** and **If white, then size* 



Insert Figure 2 about here 



In the experimental list, two pairs correspond to each of the two 



Although probably not attainable, this ideal can be approached in many cases* 

Another problem involved in work with actual subject matters is that of 
complexity. More than one principle may, and usually does, enter into a 
single test response. To determine the learning underlying the response, 
it is often necessary to assess each principle Individually, as in diagnos- 
tic work with school children. 

In many test situations, there are few available responses from which to 
choose (as in True-False and Multiple Choice tests). Under these conditions, 
there are additional problems of assessment since there is a high probabil- 
ity of giving any particular response (by guessing) irrespective of learn- 
Ing type* A similar problem obtains In assessing concept learning* There 
are at least three ways of minimizing this problem: (1) present more than 

one test stimulus, (2) include appropriate controls for comparison (e*g*, 
Greeno & Scandura, 1965), and (3) provide an alternative for guessirg as 
was done in the pilot study described above. 

^^The assessment methodology employed in this research may be used in 
conjunction with two types of variable: (1) those i^hlch affect the prob- 

ability of principle learning and (2) those which affect response consist- 
ency* Giving directions and presenting cues, hints, or other attention- 
getting devices provide examples of the former type of variable* The 
consistency with which ^ responds according to a learned principle may be 
influenced by feedback > as well as instruction variables, operating between 
the first and second test responses* 

A study dealing with the effects on principle learning of cueing various 
relevant stimulus properties is currently underway in our laboratory* 
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principles. The control stimulus properties (sizes, colors, and shapes) 
and responses (shape and size names) are identical with those in the 
experimental list. In addition, the responses, in both lists, are names 
of the corresponding stimulus properties. Any differences in the leam- 
ability of these lists would be hard to attribute to anything but the 
presence of relationships between pairs in the experimen::al list . 21 

Assuming S and E agree on the relevant stimulus dimensions, ^*s 
task in learning the experimental list can be viewed as that of discover- 
ing the principle identifying (I) and response determining (D) attributes 
since 0 is the identity map between D and R. On the other hand, ^ could 
learn the experimental list without noting any relationships between the 
pairs. Only one alternative is available in learning the control list; 
there is no principle involving more than one pair. To the extent that 
relationships between pairs are noted, the experimental list should be 
easier to learn. 

The mediation description of the list contingencies in Figure 2 
leaves much to be desired. The representation of principle learning is 
relatively complex and would have been even more so had we not let *’rs" 
represent the typically made distinction between mediating responses and 
their assumed stimulus properties. No single chain, for example, can 
adequately represent principle learning in which more than one pair is 
involved. 22 The 1-1 pairing between the and Ri (i « 1, ...9 4) does 

21 lt may appear that an appropriate control list could be constructed by 
pairing the experimental stimuli and responses in random fashion. Alas, 
this turns out not to be a critical control. Any differences between the 
groups could then be attributed to pre— experimental associations (in the 
experimental group) between stimulus properties and the corresponding 
responses (shape names) rather than to relationships between pairs. 

22xhe S-R representation proposed is original with tb-£ author as far as 
can be ascertained. 
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not follow from an analysis of the S-R links in the longer three stage 
chain. The chain does not make clear, for example, why is the resporse 
to rather than R2* The more direct two-link chains Involving the rs^^ 

(i » 1, ...» 4) serve this purpose. 

In view of this complexity, perhaps the most crucial limitation may 
prove to be the ina;>ility of S-R formulations to lead one to ask practical- 
ly important questions concerning meaningful learning. The S-R representa- 
tions that would seem to be called for bear more than a passing resemblance 
to pre-Copernlcan epicycles and related attempts to salvage geocentric 

theory. 

With the assistance of Judith Anderson, I (Scandura, 1967) conducted 
a pilot study that is relevant. Its purpose was to determine relationships 
between the number of S-R pairs per principle, in a PA list, and learning 
rate and transfer. 

The materials to be learned consisted of 12 pair lists. Each stimulus 
had a property relating to shape, border, shading, outline, and color. 

Four colors and eight values of each of the other four attributes were 
used. The responses were descriptive labels attached to the non-color 
stimulus properties (e.g., circle). Of the 12 pairs in each list, six 
were instances of one principle (P6) , three were instances of another (P3) , 
two were instances of a third (P2) , and one was an instance of a fourth 
(PI). The principles were constructed so that the same principle applied 
to all stimuli having a particular color. The response determining cue 
was either a shape, a border, a shading or an outline. The four colors 
and the determining attribute dimensions (e.g., shape) were randomly 
paired to form four principles (e.g.. If black, then shape), which appeared 
equally often under each condition. The PA list was learned by the 
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antlclpatlon method to a criterion of three consecutive errorless trials. 

To determine whether the principles were acquired sometime during the 
list learning, each S was shown two transfer lists of four new stimuli 
each, eight in all. Each transfer list Included one stimulus, associated 
with each of the four learning principles. Responding according to one of 
the principles was presumed to indicate that that principle had been 
learned. 

Prior to learning the original list, each of the 20 college Ss was 
pre— trained so that he was familiar with the stimulus dimensions and could 
name each stimulus property. These responses were typed on a card and 
were always available to In addition, ^ was told that a pattern was 
involved which might facilitate his learning and guide his responses to 
the transfer stimuli. 

The dependent variables were the average number of errors per 
Instance (i.e. , an S-R pair associated with a principle) for each S (on 
each of the four principles) and the number of appropriate responses to 
the transfer stimuli. 

Except for a very small reversal between treatments P3 and P2, 
learning rate (i.e., the average number of errors per instance) decreased 
with the number of instances per principle: 5.0, 3.4, 3.5, and 2.7, 

respectively (F « 8.76, df » 3/76, p < .001). The difference between PI 
and P2 was significant (F*-® 11.50, df " 1/76, p < .01) but none of the 
other adjacent means differed significantly. Under the experimental 
conditions, the rate of learning an S-R pair increased with the addition 
of a second S-R Instance but Increasing the number of Instances still 
further apparently had little effect. 

The number of appropriate responses to the transfer stimuli was also 
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affected by the number of instances per principle. There were 27, 8, 1.3, 
and 9 appropriate responses (as indicated by the experimental principles) 
given to the P6, P3, P2, and PI transfer stimuli, respectively. Although 
the trend was not entirely regular, a sign test indicated that the degree 
of principle learning was higher in treatment P6 than in the average of 

treatments P3^ P2, and PI (z = 2.6, p < .005).^^ 

Another analysis demonstrated that P6 transfer was related to learn- 
ing rate. Of those 9 Ss who responded appropriately to both P6 transfer 
stimuli, 7 had j^low median (2.61) error scores, indicating more rapid 
learnings of those 11 Ss who responded appropriately to at most one test 
stimulus, 8 had above median error scores, indicating slower learning. 

An exact probability test (Finney, 1948) on the resulting 2x2 contingency 
table indicated a significant relationship between P6 transfer and learning 
rate (p < .035). The small number of Ss who gave two appropriate responses 
with respect to the other principles precluded the possibility of obtaining 
significant relationships. Only 3, 5, and 2 ^s gave both desired response.*. 

to the P3, P2, and PI test stimuli, respectively. 

The list learning and transfer results were not entirely consistent. 
The inclusion of more than two instances did not affect learning rate, 
but it may have affected transfer. These results could reflect real 
differences or be simply artifacts of the situation. In either case, the 
overall pattern of results was sufficiently clear to make any interpreta- 
tion in terms of stimulus or response generalization extremely difficult, 
if not impossible. Some resort to S-R generalization (Hull, 1943, 

23it might be argued that the difference in the number of appropriate 
responses was due to there being more responses per category in treatment 
P6. When in doubt, the ^s may have tended to give a response from the most 
frequently experienced category. A comparison, however, of the average 
number of P6 responses given to the P3, P2 and PI transfer stimuli (16) was 
not significantly higher than the ten P3, P2, and PI responses given to the 

P6 stimuli (p > .10). 
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Berlyne, 1965) would appear necessary. 

Principle Generality. Learning, and Response Consistency . - Sub j ect 
matter specialists, particularly those in mathematics, are inclined to 
emphasize the importance of teaching general, rather than specific, 
principles. Experimental data, on the other hand. Indicate that the more 
closely learning and test stimuli approximate one another the better the 
test performance. 

Armed with the denotative characterization of a principle, as a set 
of ordered pairs. Woodward, Lee, and I (Scandura e^ al , 1966) set out to 
reconcile opinion with apparently discrediting fact. In particular, we 
were concerned with the. effects of principle generality on learnabillty 
and transfer. In the same study, we also explored the response consistency 
hypothesis with more complex materials. Two experiments were conducted, 
the Independent variable in both cases being the scope of a principle 
statement. Scope was defined in terms of the corresponding denotation, 
one statement being more general than another if the denotation of the 
former included the latter.^^ 

Our original hypotheses were that: (1) the scope of a principle 

would be fully reflected in performance, there would be little success 
with extic scope problems and no differences in performance on within- 
scope problems, (2) the learnabillty of a statement, as determined by 
within-scope performance, would depend on scope, and (3) the combining 
rule taught would be used on all problems under conditions of non- 
reinforcement. 

2Awotlce that defining a denotation as a set, makes it possible to consider 
a variety of other relationships between different denotations. In parti- 
cular, two denotations may be discrete (have no instances in common), over- 
lap, or identical in addition to being ordered (one being more general than 

another) . 
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In the first experiment, each group of 17 college Ss was presented 
with one of three ordered priiKlples dealing wich a number game called 
NIM (Banks, 1964, 55-58). In the game, two players alternately select 
numbers from a specified set of consecutive integers, beginning with one, 
and keep a running sum. The winner is the one who picks the last number 
in a series with a predetermined sum. If this sum is 31 and the set 
consists of the integers 1-6 the players alternatively select any number 
from 1-6 until the cumulative cum is either 31 or above (in which case 
no one wins) . There is a compound rule which allows the player who goes 
first to win any such game, **&ivlde one more than the largest number in 
the set into the desired sum— make the remainder the first choice— on 
subsequent tries, consistently select that number which when added to 
the opponent's preceding choice sums to one more than the largest number 

in the set." 

Each such game could be characterized by an ordered pair of Integers. 
The application of each principle was illustrated with a common (6, 31) 
game. The least general principle (S), adequate for winning only (6, 31) 
games, was stated, "...make 3 your first selection. Then... make selection.! 
so that the sums corresponding to your selections differ by 7. Principle 
(SG) was adequate for solving (6, j) games j = 1, 2, ..., n and was 
stated, "the first selection is determined by dividing the desired sum by 
7. ...Then... make selections so that the sums corresponding to your 

selections differ by 7.=' The most general principle (G) was adequate for 
solving (1, j) games i = 1, 2, ..., m; j * 1, 2, ..., n and was stated, 

'the first selection is determined by dividing the desired sum by one more 
than the largest integer in the set from which the selections- must come. 
...Then... make selections so that the sums corresponding to your selections 
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dlffer by one greater than the largest integer in the set from which the 
selections must come.*' 

All ^s, including two control groups, were tested on three problems. 
The first was within the scope of each principle, the second within the 
scope of all but principle S, and the third only within the scope of 
principle G. 

The results were straightforward. Of those 13 _Ss in group S who 
solved problem one, none solved problem two, and only one solved problem 
three. The corresponding numbers for groups SG and G were, respectively, 
5, 4, 0 and 5, 5, 4. Within the scope of each principle there were only 
chance differences in performance on the problems. On the other hand, 
only one ^ solved an extra-scope problem. 

The relative interpret ability of the three rule statements was 
determined by comparing group performance on problem one which was within 
the scope of each. Rule S proved to be easier to learn, under the self- 
paced conditions, than did the rules SG and G (p < .007 in both cases). 
There was, however, essentially no difference in the interpretability of 

rules SG and G. 

The third facet of this research was concerned with the consistency 
with which presented principles are applied. We wanted to determine 
whether the S and SG ^s would use the rule taught even when it was 
inappropriate (on the second and third problems). To make this possible, 
no information was given as to when the principles were and were not 
appropriate. In effect, the properties belonging to the set, I, were not 
specified— the ”if‘ statement was identical for all three rules. 

Of the 17 S Ss, 13, 9, and 8 used the rule taught on problems one, 
two, and three, respectively. The corresponding numbers in groups SG 



and G were 7 , 7, and 5 and 6, 6, and 6. Although there was a slight 
tendency to not use the rules taught on problems two and/or three, there 
were no significant differences in frequency of use. 

These results certainly provided strong support for our original 
hypotheses: (1) performance on within-scope problems did not differ 

appreciably, even though the common illustration was more similar to 
problem one than th« others, and successful problem solving was limited 
almost exclusively to within-scope problems, (2) rule S proved easier 
to interpret than rules SG and G, and (3) the rules taught tended to be 
used consistently on all problems whether they were appropriate or not. 
The first mentioned result is particularly Interesting since it tends 
to cast doubt on the assumption that there is a generalization gradient 
associated with S-R generalization and therefore provides indirect 
support for a rule interpretation (e.g., Berlyne, 1965, 171-174). 

About the only major unanticipated result in experiment one was that 
rule G proved as easy to interpret as rule SG, In view of the rather low 
proportion of successes in these groups, we were originally tempted to 
attribute the lack of such an effect to scale insensitivity. 

To determine the generality of these findings, a second experiment, 
dealing with arithmetic series, was conducted with junior high school 
Ss, In this experiment, both scope (S, SG, G) and example (present, 
absent) were varied independently. One of the basic ideas, adequate for 
summing arithmetic series, repeated addition, was already familiar to 

25Even if a generalization gradient is eventually demonstrated, S-R 
interpretations will have to consider the possibility that the result 
is simply an artifact resulting from the use of continuous dimensions 
and individual differences in *‘just noticeable differences" (e.g. see 
Lykken, Rose, Luther, & Maley, 1966). To the extent that the variables 
involved in meaningful learning are discrete, a rule interpretation may 
prove more useful. 
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most of the ^8 and it was felt that an illustration of a rule might 
provide a basis for generalization, via discovery, to extra-scope problems. 
Another difference between this experiment and the first was that rula S, 

50 X 50 (b 2500) , was effectively an answer given treatment and applied to 
only one series. It was used both as the common example and as problem 
one. In experiment one, rule S applied to a class of games. 

Although the pattern of results shown on Table 1 paralleled those 
of experiment one in most respects, there were several important differ- 
ences. First, the presence of the example (problem one) along with rule 
S resulted in significantly better performance on problem two than when 



TABLE 1 



Summary^ number of correct sums (uses of rule taught) 
on problems one, two, and three 

Rule Rule and Example 





N 


one 


two 


three 


N 


one 


two 


three 


Group S 


20 


8(8) 


1(1) 


2(1) 


21 


20(20) 


9(0) 


3(0) 


Group SG 20 


5(5) 


4(5) 


0(5) 


15 


11(12) 


8(9) 


1(6) 


Group G 


19 


3(5) 


5(7) 


2(7) 


19 


18(18) 


14(16) 


3(15) 



rule S was shown alone. This was the only case in both experiments where 
non-negligible success was noted on an extra-scope problem. Using the 
analytical tools described above, it is conceivable that this effect was 
due to an important conceptual difference between the rule S statement 
and the others. ”50 x 50” is clearly an instance of the more general 
combining rule, ”n x n = n^.” This was not the case for any of the other 
rules. Presumably, the statement of rule S, together with the common 
illustrative series, 1 + 3 + 5+ ... + 97 + 99, provided the successful 
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S Ss with enough cues to generalize. In particular, they may have dis- 
covered that this series had 50 terms. Hindsight suggests that this 
difficulty could have been overcome by simply stating the sum, 2500, of 
the illustrative series rather than ”50 x 50.“ 

Second, only three of the 19 G-with-example ^s solved problem three 
whereas 18 solved problem one and 14 solved problem two. The decrement ■ 
between problems two and three was significant (p < .004). The reason 
for this difference was not immediately apparent . An intensive post hoc 

I 

analysis of the test papers, however, suggested that the result may have 

been due to a difference in the derivation rule for determining the 

number of terms, W, for use in the combining rule, (A_+JL)m. N could be 

2 

determined from problem series one and two by taking the average of the 
first and last terms. This procedure led to an Incorrect value (25, 
rather than 24) for N in the third series, 2 + 4 + 6+ ... + 46 + 48. 

In short, the difficulty was not in the rule itself but in finding the 
correct value of N. Such difficulties may be circumvented in future 
experimentation by controlling for unwanted differences involving 
identifying properties, D (and/or combining rules, 0). 

Third, the results of experiment two cast further doubt on the 
hypothesis that the interpretability of a principle statement depends 
solely on the generality of its denotation— although it may tend to covary 
with generality. As indicated above, the more general the principle, the 
more abstract the properties and rule used to characterize it. Abstract- 
ness, of course, refers to the hierarchical level of the set properties— 
one property being more abstract than another if it refers to a collection 
which includes, as elements or derivatives thereof, sets to which the less 
abstract properties apply. 




The interpretability of principle statements ("then** clauses in the 

present study) of varying generality may depend on both abstractness and 

the way the properties . in I, D and R and the operation 0 are represented. 

Thus, making operational use of the relatively abstract arithmetic seriea 

property, **the difference between adjacent terms in some common value," 

necessarily presumes that, ‘the difference between adjacent terms is two ," 

. . three .'* "etc., ‘ can all be correctly interpreted. The converse, 

however, does not necessarily follow. Interpretability may also depend on 

the symbols actually used to denote the critical properties (in the sets 

I, D and/or R and/or 0) . Whereas ‘'N'* might suffice for one _S, another 

might require, "the number of terms in the series.** Both have the same 

referent, but the former symbolizes the latter expression more succinctly. 

Similarly, one S may be able to "compute (AjfL)jj** whereas another could 

2 

not, requiring Instead a statement like, "add A to L, then, divide the 
resulting sum by 2, and finally, multiply the quotient so determined by 
N." The latter rule statement simply makes clear the sequence of steps 
and binary operations implied by the algebraic statement. 26 

These observations lead to the following tentative definition of 
description level . Symbolic representation A Is of a higher level than 
B with respect to some reference symbolism (usually the native la nguage^, 
if the translation of A Into the reference symbolism requires all__of 



26why one way of symbolizing a statement is more interpretable than 
another, rather than vice versa, is a difficult question to answer, but 
it probably relates to the order in which symbolizations are learned 
(i.e., the native language f-’rst). Ordinarily, shorter statements are 
substituted for longer ones as their use becomes more frequent. Perhaps 
this is a natural process resulting from man's tendency to recode 
information into a manageable number of chunks (e.g.. Miller, 1956). At 
any rate, it is to be expected that the shorter and simpler the representa 
tlon, the more easily will it be learned and remembered. 
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those r eference symbols needed to translate B plus so me additional 
symbols of Its own * Statements generally can be only partially ordered 
as to description level. 

In addition, since both abstractness and description level may vary 
concurrently, not all statements, referring to the same principle, are 
necessarily comparable as to Interpr etablllty. 

Fourth, only one of the Ss who was shown the rule, 50 x 50, applied 
it to problems two and three. This result can probably be attributed to 
interfering effect due to prior familiarity with addition problems. 
The Ss may simply have mistrusted rule S. How could a rule, like 50 x 
having only one answer, be the sum of all three problem series. Host 
junior high school ^s would find it unreasonable that the series 1 + 3 + 
, . , + 99 (problem one) and 1 + 3 + . . . + 79 (problem two) have the same 



27/^lthough they were not directly involved in the present study, there are 
at least three other ways in which principle statements may differ as to 
interpretability. First, the form of the statement may differ. Thus, 
instead of ”If A, then B,” a principle might read *'B whenever A. Such 
differences could be important, especially at early developmental levels. 

Second, the determining properties may be partially ordered as to 
discemibility. Pr operty A Is said to be more discerni ble than proper^ 

B If the det ermina tion of B rcouires the determina tion of A along^ with 
the determination of othe r properties or the use of one or more addition^ 
combining operations . In this sense, the first term. A, in an arithmetic 
series is more discernible than the number of terms, N. Determining N 
requires the discrimination of A and all of the other series terms plus 
counting . 

Third, combining operations may also differ as to interpretability 
even when their domains and ranges are identical. Rule A is said to be 
more int erpretable than rule B If rule A Is equ ivalent to^ rule B bu^_t^ 
fllmnlificati on of rule B (to produce the desired response) nefiesgarlly. 
InvolveTX The combining rule b^-a^ . for example. Is more interpretable 
b 2 

than the rule ft dt, where the parameters a and b and the determined value 

® ^ 2 9 

are the same in both cases. The simplification of ft dt (* ) 

a 2 

involves the algebraic rule b^-a^ . How broadly applicable this defini- 

2 

tion is is an open question, but it would appear to hold promise as a 
first approximation. 
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sum (50 X 50)* Some such reluctance may also have obtained on problem 
one with group S-wlthout-example« Nonetheless » v/e were surprised that 
only 8 of those 20 ^s, not presented with the Illustrative series » gave 
the correct sum (2500 or 50 x 50) for test-series one. 

The most Important feature of these exploratory experiments was not 
the results, but the post hoc analyses they made possible. These analyses 
have led to what 1 believe Is an Improved formulation of the SFL, in 
general, as well as of the problem of statement interpretablllty, in 
particular. Host important, the present findings attest to the importance 
of strictly subject matter considerations in behavioral research. A 
growing body of research on meaningful learning (e.g., Gagne, 1961, 1962 
Scandura, 1966, 1966, 1966; Roughead & Scandura, 1967) suggests that 
structural consldbratlons are often more crucial than strictly behavioral 
variables such as amount of practice, massing, etc. 
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SFL .Al'IALYSES 

In this section, four problem areas have been selected for SFL 
analysts: (1) exposition and discovery learning* (2) reversal and non- 

reversal shifts, (3) conservation tasks, and (4) syntactic and semantic 
learning* Each of these problems has posed considerable difficulty for 
S-R analysts* To my knowledge, only the reversal-nonreversal shift 
problem has been successfully formulated in S-R terms (e*g*, Kendler & 
Kendler, 1962)* Whether or not an S-R formulation can ultimately be 
derived, however, is not in question* My point is simply that the SFL 
is up to the task and that I doubt that S-R formulations, ev*.n if devised, 
will be equally as helpful in generating fruitful research hypotheses* 

Exposition and Discovery * - Previous studies involving expository 
and discovery modes of instruction have not been entirely consistent, 
even when the experiments, themselves, have apparently been well controlled 
(e*g*, Craig, 1956; Gagne & Brown, 1961; Haslerud & Meyers, 1958; Kersh, 
1958; Wittrock, 1963)* These discrepancies have been due, in no small 
part, to the inadequate specification of underlying constructs* By 
helping to strip expository and discovery contexts of unessentials, SFL 
analyses may suggest important similarities and differences between 
treatments* For the sake of brevity, I shall not attempt comprehensive 
coverage of all possible expository and discovery treatments, but simply 
to give enough of the flavor of what is involved so that other analyses 
may be Inferred* 

Learning by exposition may take several forms, most of which can be 
reduced to Interpreting statements of given principles* The scope of 
such statements (l*e*, the inclusiveness of the corresponding denotations) 
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may differ greatly. In some expository treatments, the responses corre- 
sponding to distinct stimuli are stated directly. In effect, the principles 
involved all have one pair denotations. Such a treatment is typically 
referred to as "answer-given" (e.g., Wittrock, 1963). In at least one 
study (Gagne & Brown, 1961) , the answers given were formulas for finding 
sums of series rather than responses of the sort that have more typically 
been used, namely sums . Such apparent differences have led to confusion 
and have made direct comparisons difficult, if not impossible. Typical 
characteristics of this type of exposition are that the sets I and D, in 
the principle statements, are equal and the combining operation 0, 
effectively maps a single experimental stimulus into a single response. 

In other expository treatments (e.g., Craig, 1956; Haslerud & Meyers, 

1958; Scandura et^, 1965; Wittrock, 1963) more general principles 
(often called rules since the "if" clause is usually left to be inferred) 
have been used. 

The criterion measures, used in conjunction with such treatments, 
have included examples used during learning to illustrate the principles 
(learning test) , performance- on new instances of the principles taught 
(transfer), and petformance on new instances of new principles (nonspe- 
cific transfer). 28 in each case, test performance can be predicted on 
the basis of what has been learned. Performance on the illustrative 
items, of course, simply reflects the degree to which they are learned. 
Similarly, performance on new, but within scope, items reflects the 
degree to which the corresponding principle has been learned (i.e., 
interpreted correctly). Satisfactory performance on new instances of 

28^l^jjough frequently measured, retention is not considered in this 
analysis. 
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new principles is to be expected only if either the principles presented 
provide a basis for generalisation (as the rule "50 x 50" provided a 
basis for generalisation to "n x n") or other appropriate, but usually 

unidentified, ssSi onerandl have been previously acquired. 

Discovery learning typically involves presenting stimuli or instances 
of principles and calling attention to critical stimulus cues, combining 
rules, and/or relationships between the stimuli and responses. In most 
experimental studies involving discovery learning, the learner is shown 
one stimulus at a time and is given varying amounts and kinds of direction ■ 
as to how he might determine the correct response. The learner may be 
given hints which direct his attention, provide analogies, etc., as to 
what the critical (stimulus) properties, in sets I and D. are and/or how 
those in D may be combined to determine R. The more information given, 
presumably, the more likely is discovery to occur. Typically, a good 
deal of cue selection and combination is left to the learner. 

in comparing learning by exposition and discovery, some Investigators 
(e.g., Craig, 1956; Haslerud 6 Meyers, 1958; Wlttrock, 1963) have been 
concerned solely with the manlpulable aspects of the stimulus situation. 

The occasion for reception or discovery learning is provided first and, 
then, followed by a test(s) for transfer. The results of such testing, 
however, are typically biased by uncontrolled differences in original 
learning. Since the relative ease of learning by exposition and discovery 
undoubtedly depends on the mat erial in question, this approach can hardly 

29A„other fom o'fVSL^i^tl^ 

to the stimulus member of a new instance indicatr . 

learner is required to ®**®*'“*,* properties in I and D and., 

earlier, this process involves ® rule*^for going from D to R. 

in turn, discovering the common derivation rule for going rr 
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be expected to achieve definitive results insofar as transfer is conceriiv.d. 

Other investigators (e.g., Gagne & Brown, 1961) have been careful to 
equate original learning before testing for transfer, but they are open to 
criticism for not making explicit the relationships between their treatment 
and criterion measures. In particular, the relative advantages of reception 
and discovery learning depend on what is presented, on what criterion 
measure the groups are equated, and on what measure they are compared.^® 

Since the study was well designed, apparently definitive, and con- 
trolled for original learning, an analysis of the Gagne and Brown (1961) 
study is particularly instructive, in that study, the £s were presented 
with number series, such as 1 + 2 + 4 + 8 + 16 + 32 + ..•, and were either 
given or required to discover algebraic formulas, involving the number of 
terms, n, for summing any series beginning with the specified terms and 
continuing with the same pattern* After completing a preliminary program, 
designed to acquaint the ^s with the concept of a number series and a 
number of terms relating to such series, three treatments were given. C^e 
group (RE) was presented, in turn, with the correct formulas, another group 
(GD) was given indirect guidance as to how to find such formulas, and a 
third group (D) was instructed to find formulas and was given hints as to 
what the formulas were like. 

Since the error rate on all programs was reasonably low, it may be 
assumed that the three groups learned the desired rules to about the same 
degree. That is, each ^ who completed one of the programs was probably 
able to write the correct formula associated with erch series presented. 

The required learning was simply a set of discrete one instance principles. 




^^eaming ability is often cited, by pedagogical enthusiasts, as a major 
advantage of discovery methods of instruction. 
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The combining operations, 0, learned by the R & E group, were probably 
simple associations between the properties in D and those in R, leaving 
little opportunity for abstracting a general rule for determining 
formulas. The GD group, and to a somewhat lesser extent the D group, 
had an opportunity to discover a stimulus proc-issing rule.; a rule which 
may have made it possible for them to determine the formulas correspond- 
ing to the test, as well as the training, series. 

One of my doctoral students, William Roughead, and I carefully 
analyzed the training and test series as well as the discovery treatments 
themselves and found that this was the case. The technique by which the 
discovery Ss were led to discover the training formulas was also applicable 
to the test series. There was, in effect, a common principle relating to 
both the training and test series, but only the discovery groups had an 
opportunity to discover it. 

As might be expected on the basis of this analysis, the results 
obtained by Gagne & firown (1961) were clear cut. The test performance of 
the discovery groups was reliably superior to that of the exposition group. 
Gagne and Brown (1961) originally indicated that it might be possible to 
guess what information would be helpful to the discoverer, but they could 
not specify exactly why this information enhanced discovery. The present 
discussion suggests that such answers can be obtained from SFL analyses. 
The relationships become explicit in terms of the principles, statements, 
and denotations involved. More Important, there appears to be no reason 
to suspect that SFL analyses will prove any less useful in formulating 
future research in this area.^^ 

31william Roughead and I have recently completed a study in which "what 
is learned' in discovery learning was taught by exposition and in which 
the order of giving answers and providing an opportunity for discovery was 
varied. The preliminary results appear very interesting and they will be 
reported in the near future. 

I 
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Reversal &ncl Nonrcvcrssl Shifts * ^ Figure 3 characterizes reversal 
and nonreversal shifts. The objects shown vary on two dimensions, size 
(large, small) and color (black, white). These objects are shown in pairs 
and S is required to choose the correct alternative, indicated by + in 

Fig. 3. 

The experimental paradigm involves learning to make two discrimina- 
tions, the second after S can reliably make the first. The first discrimi- 
nation is identical for both groups; the second depends on the treatment, 
reversal or nonreversal. On the first discrimination, in the example shown 
in Fig. 3, size is the relevant dimension, large being positive. Color is 
irrelevant. A reversal shift involves the «®e«d4ttonsion, size, out the 
correct response becomes small. A nonreversal shift involves the color 
dimension, black being correct. 

First Discrimination Second Discrimination 

Small Postive 





Fig. 3. Example of a reversal and a nonreversal shift. 
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Preverbal children (Kendler, Kendler & Wells, 1960) and animals 
(Kellehcr, 1956) find nonreversal shifts easier to make than reversal 
shifts whereas older, more verbal, ^s find reversal shifts easier (Buss, 
1956; Harrow & Friedman, 1958, Isaacs & Duncan, 1962; Kendler & D*Amato, 
1955). In a study with kindergarten's, Kendler and Kendler (1959) found 
that fast learners, like verbal Ss, were better able to make a reversal 
shift whereas slow learners, like preverbal £s, were better able to make 
a nonreversal shift. It was suggested that the fast ^s approached the 
experimental task with verbal labels for the correct stimulus already 
strongly attached, the verbal label serving as a mediating link in a 
two-stage S-R paradigm. The learning of the slow and presumably preverbal 
Ss was assumed to involve a single-stage paradigm. Kendler and Kendler 
(1962) explained the relative ease of reversal and nonreversal shifts in 
terms of the number of S-R associations that need to be changed. 

This interpretation, however, makes no provision for answering the 
question of whether the increasing ease of making reversal shifts with 
age is due to a gradual increase in verbal ability by all ^s or to some 
specific characteristic had by a larger proportion of faster learning 
children. Reformulating the reversal-nonreversal problem in the SFL 
provides a basis for answering this question. 

Since this problem is different, in an Important sense, from any 
previously encountered in this paper, some preliminary observations are 
in order. For one thing, the stimuli consist of pairs of objects. In 
addition, the critical stimulus (set) properties are relationships 
between objects.^^ Learning to make the first discrimination shown in 

32stimuli having a common relationship need not have any ’’physical’’ 
property in common. 
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Fig. 3» for example 9 may be equivalent to learning a principle in which 
► I • D ■ [one object larger than other], R “ [larger object], and 0 is a 

rule which maps the relational property in D onto such a discrimina- 

tion can also be accomplished by learning two or more less general 
principles. Notice that during the first discrimination task, S is never 
presented with two large or two small objects. Thus, S could learn to 
always choose the large black object when it appears and the large white 
object when it appears. The fact that both are larger than the object 
with which they are shown might, for ei^ample, go unnoticed with young 
children. One such less general principle might be stated, "If shown 
two objects, one of which is large and black, then the response is 
determined by choosing the large black object.''^^ 

Learning a single relational principle on the first discrimination 
should result in reversal shifts being easier than nonreversal shifts. 

In a reversal shift the critical properties would remain identical, only 
the operation, 0, would need to be changed— pick the smaller, rather than 
; * the larger, object. A nonreversal shift would Involve learning either 

a completely new general principle or two new more specific principles, 
in which both the critical cues (involving color) and operation need to 
j be identified. On the other hand, if two less general principles are 

learned on the first discrimination, a reversal shift would involve 
learning two new subprinciples (or one more general principle) whereas a 

principle of this sort is analogous to what Bruner, Goodnow, and Austin 
(1956) have called a relational concept. Although not treated here, 
is reason to believe that transposition phenomena (cf. Hebert & Krantz, 1965) 
may also be reformulated in terms of relational concepts or principles. 

3^It should be emphasized that S need not be aware of verbal labels in 
order to learn a principle.- The statements in the text serve primarily a 
definitional function. 

r 
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nonreversal shift would involve learning only one. The principle, indicated 
by the statement, "If two objects, one of which is Icrge and black, then 
choose the large black object," is equally applicable to the original 
discrimination task and the nonreversal shift task in which black is posi- 
tive (see Fig. 3). In short, the relative ease of shift may be dependent 
on "what is learned" on the first discrimination. 

Of course, this interpretation is analogous to that presented by S-R 
theorists (e.g., Goss, 1961); Kendler & Kendler, 1962). It is the implied 
assessment methodology which provides the means for determining whether 
‘what is learned" is related to relative ease of shift. In order to 
determine ^s basis for making the first discrimination, it is necessary to 
employ dimensions, such as color and shape, which have more than two 
easily discriminated values. This procedure would make it possible to use 
two values of each dimension in presenting the first discrimination 
problem leaving the other values for assessment purposes. Suppose, for 
example, the four objects used in training are either black or shaded 
a circle or a square. Suppose, further, that color is the critical 
dimension on the first discrimination. Then, the assessment procedure 
might involve a new discrimination problem in which the four objects used 
are descrlbable in terms of the two colors (black, shaded) used during 
training, and a shape (e.g., triangle) not so used. 

In order to help minimize "strategy shifts," between learning and 
assessment, positive reinforcement might be given at each choice point, no 
matter what the response. Under these and appropriate instructional 
conditions, choosing, as positive, the object having the same color as 
the positive object, used during training, would be indicative of learning 
a general principle (concept) on the training task were it not for the 
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high probability (1/2) of choosing this object by chance alone. Assess- 
ment "certainty/* of course, might be increased by using more test 
discrimination tasks. This would be made possible by increasing the number 
of values per dimension. 

Unfortunately, there is no relevant data. Because of the obvious 
Implications for future research in this area, earlier verification of 
this analysis is urged. 

Conservation ve rsus Non-Conservation . - Questions relating to the 
conservation (i.e., invariance) of such properties as amount and number 
(e.g., see Flavell, 1963), comprise another problem area in cognitive 

development that may be reformulated in SFL terms. 

Consider a procedure that is frequently used to determine whether a 
child has learned to conserve amount. E shows the child two balls of 
clay, both of the same size as in display one of Fig. 4. Indeed, E may 
let S make them the same size. £ is then asked whether each ball, in 
turn, contains more clay than the other or whether they contain the same 
amount. Invariably the normal child says they are the same. Next, E 
rolls one of the balls into the shape of a sausage as shown in display two 

^^It is possible to devise reversal-nonreversal type tasks involving more 
general principles and/or principles not involving relational stimnius 
properties. Under such conditions, assessment problems can be minimized. 

^^Since this manuscript was written, Tighe (1965) has published a paper 
in which she demonstrated that the relative ease of making reversal and 
nonreversal shifts with 5 and 6 yr. old children can be manipulated by 
prior training designed to emphasize the independence and dimensional 
nature of the properties of stimuli used in subsequent discrimination 
shifts. These results provide support for the current interpretation in 
that learning a general principle on the first discrimination requires 
that S recognize the independence of the two object dimensions. With 
the more restrictive principles only object, and not dimension, differ- 
ences are Important. 
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of Fig, 4 and, then, asks the same question. If the child says that they 
are the same, and he does so consistently on this and other tasks, he is 
said tf> conserve amount. If not, he is a nonconserver . 




Fig. 4. Two displays designed to determine the conservation of amount. 

As with reversal and nonreversal shifts, the stimuli are object 
pairs and the critical properties are relational. 



When formulated in the SFL, E is led to ask not merely whether £ is 
or is not a conserver, but on what basis ^ Is responding. The objects 
shown in the Fig. 4 displays are related in many ways— relative volume, 
weight, height, length, width, shape— besides relative amount. Any one o.: 
these (not necessarily independent) relational properties could provide a 
basis for responding. In display one, for example, the two objects contain 
the same amount and have the same length (as well as just about everything 
else except position) . Thus, a correct response could signify the 
operation of either of the two principles, ”If two clay objects, then the 
response depends on the relative amount,^' or *'If ...relative length." A 
child's reaction to display two may make it possible to determine which of 
these two principles is operating. Thus, if the child says that the 
sausage (Display Two) contains more clay, the operating principle probably 
involved length. If he says "the same," amount was likely the determining 
f actor. 




37it is worth noting that both of these two responses eliminate height from 
consideration as a basis for responding. Saying that the sausage contains 
more clay, however, does not eliminate shape . 
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The form of analysis, described above, is quite similar to that 
applied to reversal and nonreversal shifts* Both involve assessing what 
the child has previously learned. The essential difference is that, with 
reversal and nonreversal shifts, the relevant prior experiences (on- the 
first discrimination) can be specified. In developmental situations of 
the sort described, conservation of amount is assumed to have been 
acquired or not acquired prior to the experiment. If ^ does not demon- 
strate conservation, it is presumably not a question of choice but of 
necessity. The principle of length may be operating, rather than that 
of amount, not because ^ prefers it, but because he has not acquired the 
concept of amount. A child, of course, may acquire both the concepts of 
amount and length in a generic, nonverbal, sense (e.g., Braine, 1959) 
before he knows what the words ’’amount” and ’’length” actually mean. All 
that can be said with '"•onfidence is that the phrase, ”whi.ch contains 
more clay,” may be interpreted by a child in any one of several alterna- 
tive ways. 

S yntactic and Semantic Learning . - ^11 of the stimuli, as well as 
principle statements, considered so far are symbolic representations of 
an abstraction. A stimulus such as ”1 + 3 + 5 + 7,” for example, 
symbolizes an abstraction reflecting the structure of a variety of more 
concrete stimulus situations— e.g. , four stacks of pennies, the first 
containing one penny, the second three, the third five, and the fourth 
seven; a figure representing the produce of four countries, .*., etc. 

Suppose a young child has keen taught a principle which makes it 
possible to write ”16” when shown ”1 + 3 + 5 + 7.” What happens when he 
is presented with the four stacks of pennies and is asked how many there 
are? The answer to this question undoubtedly depends, to some extent, 
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on the significance to ^ of the number symbols (i.e., numerals) in the 
symbolic stimulus* If the numerals refer to properties of collections of 
sets* each including a common number of elements or objects, positive 
transfer may be expected* If, on the other hand, the numerals and the 
arithmetic operations, such as addition, have been learned entirely with- 
out referents, say with flash cards, one could feel fairly certain that 
^ would see no relationship* 

Fortunately, this question can be formulated precisely by characteriz- 
ing the principles involved in the SFL* Assume that the principle (there 
are other possibilities), corresponding to the symbolic stimulus, has been 
determined, by assessment procedures, to be (I ® D ® [number in first 
position, ***, *** fourth position], 0 = repeated addition, R = sums)* 

The requisite for applying this principle, once learned, in a concrete 
situation is precisely that principle which makes it possible to go from 
the concrete situation to the corresponding numbers* A composite principle, 
consisting of this principle, along with that corresponding to the 
symbolic stimulus above, might be characterized (I *» D = [number of 
objects in first position, **., ***fourth position], 0 = determine the 
number of o bjects in each pile (position) by counting the objects, perform 
repeated addition, and note that the sum denotes the total number of 
objects, R = total number of objects) * Learning the symbolic (former) 
principle, without being able to recognize its concrete referents, would 
be like having an egg shell, but no egg* 

The relatively simple hierarchical SFL analysis proposed provides, 

I feel, but a prelude to the insights which may eventuate from similar 
analyses in other situations* Even partial clarification of the relative 
roles of symbolism and concrete referents in meaningful learning and 



performance could have Important practical as well as theoretical 
implications and is long overdue, Although perhaps most directly applic- 
able to mathematics, this sort of hierarchical analysis may also prove 
useful with other subject matters. 



THiiiORriTICAL COMMENT 



This paper deals more with a precise scientific language (SFL) than 
with theory. Although certain problems have been partially clarified, any 
^[predictions resulting therefrom would be based not so much on new theoret- 
ical assumptions as on logical analyses of the situations involved. 

Nonetheless, the Identification of the principle as the basic 
behavioral unit, is bound to have Important theoretical implications. A 
recurring theme of this paper is that the association is too restrictive 
a unit on which to build a theory of meaningful learning. Although close 
relationships have been shown between the association and the principle, 
each being derivable from the other, the representation of increasingly 
complex learning situations, in terms of associations, soon becomes 
prohibitively cumbersome. The principle, on the other hand, appears up 
to the task. Equally Important, the principle not only has much in common 
with a number of proposed basic constructs, such as rules, schemas, and 
TCTE units, but the SFL representation of a principle provides cognitive 
theorists with a much needed ingredient— precision. 

Although it is beyond the scope of this paper to present anything 
approaching a finely-spun theory, I will try to indicate two of the ways 
in which theory development, based on the SFL, might proceed. In the 
process, I hope to suggest some fundamental differences between statistical 
theories, designed to predict group behavior, and more deterministic 
theories, which make the prediction of individual behavior possible. 

The first approach is well exemplified by, but certainly not limltec 
to, stochastic learning models. In these theories, given an initial state, 
each S is typically assumed on each trial to enter the next state with a 



certain probability. This process continues until the terminal or 
absorbing state is reached. To make predictions, based on such theories, 
the values of the transition probabilities, which presumably are based on 
underlying, physiological capacities, are estimated from data acquired in 
situations which are closely related to, and usually the same as, those 
in which the predictions are mad<‘. The predictions, themselves, deal 
exclusively with parameters of the resulting distributions of scores, 
short , assumptions are made about individual learning processes and pre- 
dictions are made about group behavior. 

In order to see how such a theory might treat principle learning, 
consider a situation in which instances of a principle (i.e.f related S-R 
pairs) are presented until ^ can reliably anticipate new responses to 
new, within scope, stimuli. To discover the underlying principle, S must 

(1) determine the relevant properties of the stimuli and responses and 

(2) discover the rule relating them. In addition, if noninstances are 
Included in the test list, S would have to identify those stimulus 
properties which make it possible to discriminate between instances and 

noninstances. 

These requirements imply a model extension of the sort s”s:gested by 
Haygood and Bourne (1965). These authors postulated the need for a second 
process, rule selection, which is independent of attribute selection— th^' 
latter providing the basis for Restle*s (1962) theory of concept learning 
(i.e., attribute identification). In the present, more general, situation, 
stimulus dimensions, rather than properties, would also need to be taken 
into account. The identification of th.^ relevant response dimension would 
also be involved. Only under these conditions could principle learning 
be expected. Notice, in particular, that varying stimuli, although 
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sufflclent for concept learning, would not necessarily Increase the 
likelihood of discovering a principle from Its Instances. The responses 
would also have to vary. These considerations Imply a model with at 
least three, and possibly more. Independent processes. 

A major limitation of present-day stochastic models is that they 
are fundamentally Incapable of predicting individual performance. All £s 
are assumed to bring with them the same learning parameters (l.e., transi- 
tion probabilities); yet, it is common knowledge that the ^s enter these 
situations with Important experiential differences.^^ Such differences, 
of course, could be taken into account so that different groups of ^s are 
assigned different transition probabilities. In the limiting case, indi- 
vidual parameters could be based on individual data. But, then, no 
rational theorist would resort to such nonsense— using an individual's 
data to predict the individual's data. 

Ity reason for this discussion has not been to discredit stochastic 
models. Predicting group performance is equally as defensible, and 
Important, as predicting individual performance. What 1 have intended is 
to provide a perspective for viewing present theoretical concerns. 

Contrast the foregoing approach with a theory which says, in effect j 
that if the relevant things about ^'s present state are known, it would be 
possible to predict what he would do in any given situation. Such a 
theory would necessarily be concerned with determining what the relevant 

^^There are, of course, situations in which organlsmic differences may be 
minimal. The backgrounds of planaria and new-born babies undoubtedly 
differ less than that of normal adults. Similarly, humans are more likely 
to flutter their eyelids in the same way than they are to solve mathematlca' 
problems. Under such conditions, largely ignoring the effects of prior 
learning has caused relatively few difficulties. 
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state characteristics are and with procedures for determining whether they 
are within £*s immediate repertoire. Rather than making assumptions about 
underlying physiological capacities (e.g«, values of transition probabilities) 
to make preaictions about learning, the proposed approach would involve 
inferring behavioral capacities in one situation to predict behavior in 
another. It would be a measurement based theory, unlike that exposed by .a 
present day neoassociationism, in which internal factors, determined from 
past behavior, would play the central role. Such a theory would need to 
specify how inputs interact with the state of an organism to produce 
responses. Scandura (1966) has argued that a theory of this sort would be. 
to a large extent, independent of traditional learning theory. 

The writing of many theorists (e.g., Kuller, 1913; Hull, 1935; Berlyne, 
1965) indicates that there are at least two aspects of state that may affect 
what response is given to a presented stimulus: (1) motivation and (2) 

prior knowledge. Although I am not prepared to defend the idea here, 
motivation might conceivably be viewed as a higher order principle of some 
sort which determines what unit of knowledge (l.e., principle) is to be 
applied. The principle, in turn, uniquely determines what the response 
is to be. VJithout some such mechanism, organisms would be incapable of 
responding in any reasonable fashion. Having been presented with an 
arithmetic series written on a piece of paper, ^wouldn't know whether 
to count the number of terms , give the sum, or burn it (the paper) . 

Although the values of the state variables would necessarily have to 
be infer- 'ed from behavior, all empirical evidence would not have to be iu 
the form of R-R relationships. State variables can be manipulated by 
prior training. Principle learning, for example, can presumably be 
manipulated, at least within certain populations, by presenting suitable 
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statementj directly or by discovery. Assessment procedures would play 
an essential role in such manipulations by Insuring that the Intended 
learning did, in fact, take place. Directions would appear to be the 
manlpulable counterpart of Internal motivation. In effect, a stage 
approach to research of this type could provide a basis for establishing 
causal relationships.^^ 

Although clear ].y speculative, I would be remiss if I did not state 
explicitly what to date have been my working assumptions. First, regard- 
less of how principles are acquired in the first place, behavior, depends 
on the selection.' and use of some* principle. Second, motivational factors 
may be viewed as providing a context within which principles are selected. 
For example, the statement, "If number series, then sum," would serve to 
limit the class of potentially applicable principles by specifying both 
the relevant stimuli and the desired objective. Within this context, ^*s 
selection of a principle would depend on the stimulus properties previously 
denoted by I. Third, principles are assumed to continue in operation so 
long as the then present motivational state obtains. According to this 
postulate, responses to new stimuli remain under the control of a particular 
principle unless new input or feedback otherwise indicates that the rules 
have changed. A tentative fourth postulate is that motivational states 
may be subject to control by higher order motivational states. 

Clearly, these speculative postulates do not, in themselves, consti- 

^^This is not to imply that a stage approach has not been used before. 

Much of the research in the Kullian tradition, involving drive and habit 
strength, would fit this general paradigm. The research of many investi- 
gators (e.g., Hilgard, Irvine, 6 Whipple, 1953; Katona, 1940; Maltzman, 
Eisman, Brooks & Smith, 1956), who have been primarily concerned with 
meaningful learning, also has been conducted in stages. More recently, 
Gagne (1964) has argued pervasively for a stage approach to problem 
solving research. 



tute the basis for any theory. At a minlmums clearer specification is 
needed as to how new principles are brought into play and how existing 
knowledge is modified. Some speculations can also be made as to the 
mechanisms underlying such changes. Perhaps the simplest assumption is 
that a principle ceases to operate when it is no longer applicable— as 
when the stimulus input does not correspond to an Instance of that 
principle. There are at least two ways in which such a situation may arise. 
The environmental situation may change as the result of the responses 
controlled by the operating principle or the change may be the result of 
extra~subject influences. Suppose j for example , that an ^ is in the 
process of nailing planking for tlie floor of a house. The operating 
principle is ‘If (the nail is) up, then hit (with hammer).” The very 
act of hitting causes the stimulus situation to change. This new stimulus, 
in turn, could serve to ”call up“ the next principle— e.g. , ‘If down, move 
to the next nail,' or ”If down and plank secure, get a new plank.” The 
introduction of extra scope stimuli or conflicting directions could also 
transfer response control to a new principle. It would be a brave (or 
stupid) individual, indeed, who did not cease hammering and run as the 
result of someone's shouting, “Watch out for the rattlesnake I” 

Cognitive disequilibrium theories (e.g., see Piaget as reported in 
Flavell, 1963; Berlyne, 1965) suggest that a learned principle may be 
modified if the response dictated by one principle is made to conflict 
ylth that otherwise indicated. Suppose the learned and operating prlnclpl^i 
can be stated, ”lf furry animal with four legs, then kitty,“ and S is shown 
a furry animal with four legs which happens to be a squirrel. The dictated 
response would be ”kltty.” If, however, feedback indicates that the 
correct response is "squirrel,” ^ is confronted with a dilemma. He can 
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el ther retain his principle, (almost) In Its original form, and remember 
the exception (as a new one-instance principle) or revise the original 
principle and assimilate the exception as an Instance of another (perhaps 

I 

new) general principle. 

The theorist might attempt to reconcile the outcome of such conflict 
In one of at least two ways. He might. If he has a physiological bent, 
for example, postulate that the amount of cognitive strain (e.g., Bruner, 
Goodnow, & Austin, 1936; Miller, 1956), Is crucial. If committed to a 
state theory, on the other hand, the theorist would presumably resort to 
higher order principles of some sort. 

CCNCLUDING REMARKS 

The role of any scientific language Is to capture the essence of an 
area— to provide a means for dealing with the relevant variables while 
making It possible to avoid Irrelevancles. Stated In more operational 
terms, a language Is useful to the extent that It leads the research worker 
to ask fundamental questions. If It Is precise, so much the better. 

In the earlier sections of this paper. It was argued that the SFL 
meets both of these criteria. The SFL was shown to be significantly broad 
to encompass a wide variety of behavioral phenomena, from rote to meaning- 
ful learning. The language Is precise, can be formulated In mathematical 
terminology whenever necessary or desirable, and provides a basis for 
relating Internal events to observables. In particular, the SFL, coupled 
with the response consistency hypothesis, makes It possible to consider 
the vexing problem of 'what Is learned." After assumptions are made 
regarding the underlying stimulus values and dimensions, learning can be 
defined In terms of observable test stimuli and responses. An assessment 




4 



- 66 - 



methodology of this sort may provide the necessary basis for the increas- 
ingly called for multi-stage approach to meaningful learning* 

Perhaps most Important, taking the principle as the basic behavioral 
unit makes it possible to deal effectively with actual subject matters and 
leads one to ask new and presumably important questions. Both the pilot 
research, that helped shape our thinking, and the analyses, described above, 
are indicative of the kinds of research suggested by the SFL. Generality, 
abstractness, prior learning, statement interpretability, and discovery- 
all matters which have long plagued thinking pedagogs as well as psychol- 
ogists— have lent themselves to rather close scrutiny. In short, the SFL 
provides a rigorous basis for studying many, if not all, forms of meaning" 
ful reception and discovery learning and their interrelationships. Future 
research may help unlock many of the behavioral mysteries that have 
traditionally been attached to these ideas* 

Nonetheless, the SFL is far from a finished product. Many things 
remain to be done. First, and foremost, the validity of the proposed 
analyses needs to be ascertained. The already completed SFL based 
research (e.g., Scandura et al. 1965) was based on a preliminary and 
largely Inadequate version of the SFL. Theoretical implications need to 
be more carefully and completely drawn out and suitable empirical tests 

conducted. / 

Second, additional modifications, extensions, and Implications of 
the SFL need to be considered. I am, for example, convinced of the desir- 
ability and feasibility of extending the SFL so as to formulate questions 
concerning programed learning, particularly as regards providing a rigorovts 
basis for such technologies as task analysis (e.g., Gagne, 1962). There 
also is reason to believe that the cognitive processes, classified in 




Bloom's (1956) taxonomy, may find precise behavioral counterparts in the 
SFL. i'laking these relationships explicit could have important implica- 
tions for measurement. 

Third, more rigorous attempts should be made to reformulate S-R 
mediation based phenomena and constructs, such as stimulus and stimulus- 
response generalization, discrimination, and classical conditioning, in 
SFL terms. Although this might not always prove to be possible, important 
insights into both languages may be so attained. 

In this regard, I might add that I am under no delusions that the 
SFL can deal effectively with all that the older S-R language can. This 
need not be a crucial limitation, however, since their primary areas of 
concern are different (Scandura, 1966). It may be no more necessary for 
the SFL to deal effectively with all S-R phenomena any more than helio- 
centric theory needed to deal with epicycles. Many of the phenomena 
dealt with within a given theoretical framework disappear when looked at 
from a different point of view. Nonetheless, although apparently lacking 
at present, the S-R language may again prove sufficiently flexible to deal 
effectively with meaningful learning. This possibility must certainly be 

entertained. 

Fourth, the question of memory has not been considered in more than 
a superficial way. If what can be learned in a given situation • 
depend 5 on certain physiological limitations, as has been suggested, 
then what happens to that information which is stored. I (Scandura, 1967, 
Scandura & Roughead, 1967) have suggested that what is stored isn't 
•’forgotten." It may simply get mixed up with what is already there. 

Prior or subsequent activity and learning could result in the storage or 
competing information in the memory locations or in making it hard to 
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relocate this location at recall. Whether such ideas can be formulated 
in SFL terms or not is a completely open, but apparently very complex, 
question. 

Finally, more explicit consideration needs to be given to the 
relationships between the SFL, its related postulates, and existing 
cognitive theories. Since the notion of conceptual confl4ct is so funda- 
mental to many cognitive theories (e.g., see Berlyne, 1965), it might 
provide u good place to start. 

At this point, no one can argue with any authority as to the ultimate 
value of the SFL. The outcome is even less certain as regards theory 
construction, nonetheless, the SFL appears sufficiently promisin ;5 to 
warrant further investigation. 
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THE BASIC UNIT IN MEANINGFUL LEAKNING^““ 

ASSCCIATICH OR PRINCIPLE? 
(Set-Function Language) 



Theoretical development in educational psychology has been extremely slow. 

One major reason has been the typically imprecise definition of independent and 
dependent variables in research on meaningful learning and teaching. Such 
research can bear only an ambiguous relationship to theory, Simixarities and 
essential differences often go undetected. As McDonald has put it, ”^ceE^^ 
clarity means (a) specification, stated in terms as nearly operational as 
possible, of the behavior involved in a task or method; some delineation of 
the range of phenomena included and excluded; and (c) precise description of the 

appropriate tests," 

Stating research objectives and defining variables -in unambiguous terms, 
however, is not sufficient. The teaching- learning process has all too frequently 
been studied in terms of administrative variables, such as class size, grade 
level, and teaching experience. The variables chosen must have broad ejqplanatory 
potential, not be merely symtomatic of and inextricably related to the question 
at haj^d. Theory development depenoa on much more than mere fact finding. 

To provide a substantive base for their research, educational psychologists 
have frequently resorted to the languages, paradigms, and theories of the motner 
science of psychology. Mediations! elaborations and operant conditioning para- 
digms of the stimulus-response (S-R) language and more general, but less well- 

specified, cognitive theories have been popular. 

Each approach has important limitations. From one point of view, poxnimony 
suggests that the properties of overt S-R associations should also be attributed 
to mediational linlcs. Yet, practice has shovm that mediational interpretations 
become increasingly cumbersome and less precise as situations become more 
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complex. Similar difficulties have plasued researchers who have used 
operant techniques to study meanlnsful verbal learning. The results simply are 
no where near so clear In complex human learning as they are in the ■•Skinner Box." 
It is increasingly recognized, for example, that knowledge of results is not 
directly analogous to feeding a pigeon and that, in any case, other factors, 
as subject matter structure, are probably of greater importance in promot g 
efficient learning. A general limitation of cognitive theories is their relative 
imprecision. Typically, "cognitions" are either not clearly specified in 

observable terms or are only partially defined. 

In short, the choice to date has been between a precise, but seemingly 
inappropriate, S-R language and presumably more relevant cognitive formulations 
which leave much to be desired insofar as scientific cohesiveness and rigor are 



concerned. 



Ihe putpose of this paper is to introduce what I feel are the basic ingre- 
dients for a new scientific language for formulating research questions on 
meaningful learning. This so-called Set-Function language (SI'L) is precise and 
seems particularly well suited for dealing with mathematics, my own area of 
concern, and science, but it undoubtedly can be used with other subject matters 
as well. Rather than try to detail the SFL or to sunmarize the related research 
that we have completed or have under way, let me simply try to convey the general 
idea. In the process. SFL and S-R formulations of several meaningful learning 

tasks trill be contrasted. 

The SFL is behavioristic, as is the S-R language, but, unlike the S-R 
language, the SFL denies the primacy of the S-R assocletioi n To illustrate 
some of the advantages of the SFL. consider the situation depicted in Figure 1. 
Suppose an experimental subject is required to learn to say the appropriate 
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word when shown a learning stimulus- -for example » to say "black*' when shown 

the large black triangle. After the original four S-R pairs are mastered 

so that the subject can reliably give the correct response to each stimulus, 

the Question remains as to just what was learned . Did the slibject learn four 

M 

distinct pairs— four discrete associations— and notice no relationships between 
them? Or, did he learn the two principles, "If triangle, then color, and 
"if. circle, then siAe"? 

This question first began to bother me during the summer of 1962, In a 

3 

study designed by Greeno we found, in a verbal concept learning situation, that 
essentially a subject either gives the correct response the first time he sees a 
transfer stimulus or the transfer item is learned in the sa«6*w as its control. 

The thought later occurred to me that if transfer obtains on the first 
trial (if at all), then responses to additional transfer items, at least under 
certain conditions, should be contingent on the response given to the first 
transfer stimulus. In effect, a first transfer stimulus could serve as a test 
to determine what had been learned during the original learning, thereby making 
it possible to predict what response a subject would give to a second transfer 
stimulus. To test this assumption, I had a total of about fifteen (highly 
educated) subjects .overlearn the list shown in Figure 1. Prior to learning the 
list, both the subjects and the experimenter agreed on the relevant dimensions and 
values— size (large- small), color <black<*»^bite) , and shape (circle- triangle). 

The subjects were told to learn the pairs as efficiently as they could, since 
this might make it possible for them to respond appropriately to the transfer 
stimuli. After learning, the test 1 stimuli were presented, and the subjects 
were instructed to respond on the basis of what they had just learned. They were 
told they were correct no matter what the response. Then, the test 2 stimuli were 
presented in the same manner. 

The results were clear-cut. All but three of these subjects gave the response. 
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"black” and "large," respectively, to the test 1 stimuli, and also responded with 
"white and "small" to the test 2 stimuli. It would appear that when a subject 
thinks he i'l right and the new situation remains relevant, he will continue to 

respond in a similar manner. 

On what basis could this happen? It was surely not a simple case of stimulus 
generalization; the responses were distinct and, in any case, did not aepend 
solely on common stimulus properties. The first tes" 1 stimulus, for example, 
is as much like the fourth learning stimulus as the first- -and yet, "black was 
invariably given as the response tather than "small," Perhaps the simplest 
interpretation of the obtained results is that most of '±e subjects discovered 
the two underlying principles while learning the original list, and later 
applied them to the test stimuli. In effect, the relationships between the S-R 
pairs themselves, combined with a response consistency hypothesis, provided a 
basis for assessing what was learned . 

Before introducing the SFL, let us ask how the S-R theorist might represent 
what was learned in the preceding exercise* In S-R psychology, the basic 
building block is the association , which was originally viewed aS a leerned 
connection between an observable stimulus- light, nonsense syllable, or mathemati- 
cal problem— and an observable OQn9e4t»nee or response— aolivat ion, another nonsense 
syllable, or solution. A connection or association is said to have been formed 
if the corresponding response appears with a positive probability whenever the 
stimulus is presented. Learning a concept^ presumably a more complex form of 
learning, involves the ability to give a common response to any one of a set or 
stimuli. To say that a subject has acquired the concept of red, fot example, 
implies that the subject is able to say some common response when shown any 
red object but will not give this response to any non-red stimulus. In short, 
whereas an association pai"S one stimulus with one response, a concept is a 
many- one relationship. 
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Since the aesocietion is felt to be basic, the S-R theorist has felt oblised 
CO represent the anyone concept relationships as a co^osite of one-one relation- 
ships. This has been made possible by the postulation of mediating links or 
associations. Thus, the many-one relationship may be represented. 
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uhere the stimuli S^.S^. and S3 are connected to the mediating response. 

«hose stimulus properties, in turn, elicit the observable response. R. In the 

.= U red M might be an internalized representation of the laccl 

case of the concept red, mgnc 

“red.” 

But the original task we were confronted with involved principles-and 
principles, from the standpoint of S-R associationism are more complex than either 
associations or concepts. It is probably because of this felt complexity that 
most psychologists, particularly experimental psychologists, have simply not been 
.uch concerned with principles. Since the notion did not readily fit into their 
scheme of things, they did what any thinlting man would do-l 80 o«« ‘I*® 

Nonetheless, an increasing number of pedagogically oriented psycho g 
have come to recognize the central role of the principle in meaningful learning, 
and at least two association-based representations of the principle have been 
proposed. Rather than delve into the possible limitations of these formulations. 

I shall propose an S-R formulation of my own-and then attacK it. 

First, consider the representation. 
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'triangle — color" 
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overt stimuli, and , are preeumed to elicit the mediator “triangle which, 
in turn, elicits the mediator “color." “Color," then, is presumed to elicit 
and R3— with equal probability. Of course, we know that this is not what happens 
from the study just described; S^goes with Rj^ , and with Ry 



To overcome this inadequacy, we may postulate a second pair of connecting 



ch&ins. 




✓* 




Black 



■White 




Accoidicg to this inteipr Jtatlor., elicits and not Rj because Sj^ elicits 

"black" as well as "triangle" and has no direct relationship to "white." Of 
course, S 3 elicits "white" .for the same (implied) reason. With this representa- 
tion in hand, the S-R associationist is now able to predict the results of the 
experiment described. "Black" is given as the response to the first test sti- 
mulus, a small black triangle, because of the preiearned association between 
the stimulus and "black" and the learned mediating association, "triangle 



elicits color." In short, two associative connections are better than one. 

Rather than go into the SFL representation of principle learning at this 
point, let me first add more fuel to the fire by presenting another example. 
Suffice it to say at this point that the essence of a principle is captured 
reasonably well by a statement of the form, "If A, then b 5" Consider the S-R 

pairs. 



If 






Suppose 
on the left, 
responses to 



( 7 , 1 , 6 , 4 ) >9 

( 4 , 8 , 9 , 3 ) »10 



( 9 , 7 , 8 , 6 ) > 

( 5 , 1 , 8 , 3 ) > 



( 6 , 5 , 8 , 9) — >5 

the learner is posed with the task of determining, from the instances 
that principle which will allow him to predict the appropriate 
the stimuli on the right. He migjit find that a difficult task. 
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overt Btlmuli, and S3 . are preeumed to elicit the mediator "triangle" which, 
in turn, ellcita the mediatcr "color," "Color," then, is preeumed to elicit 
and R3-with equal probabilinv. Of course, we know that this is not what happens 
from the study just describf,.J Sj,goes with R^ , and S3 with R-, 

To overcou.e this Inadequacy, we may postulate a second pair of connecUng 



ch&lns, 



According to 



•Black- 






iangle 






S3 



•I^lte- 



this interpr^tatior.y elicits R.^ 



- R-t 

Colo 

Rd 

rntl not R3 because elicits 




"black" as well as "triangle" and has r-> direct relationship to "white." Of 
course, S3 elicits "white" for the same (Implied) reason. With this represents 
tion in hand, the S-R assoclationist is now able to predict the resul|s of the 
experiment described. "Black" is given as the response to the first test sti- 
mulus, a small black tris:.gle, because of the • prelearned association between 
the stimulus and "black" and the learned mediating association, "triangle 
elicits color." In short, two associative connections are better than one. 

Rather than go into the SPL representation of principle learning at this 
point, let me first add more fuel to the fire by presenting another example. 
Suffice it to say at this point that the essence of a principle is captured 
reasonably well by a statement of the form. "If A. then B?' Consider the S-R 






( 7 , 1 , 6 , 4 ) >9 

( 4 , 8 , 9 , 3 ) »10 

( 6 , 5 , 8 , 9 ) — iS 



( 9 , 7 , 8 , 6 ) ^ 

( 5 , 1 , 8 , 3 ) > 



Suppose the learner is posed with the task of determining, from the Instances 
on the left, that principle which will allow him to predict the appropriate 
responses to the stimuli on the right. He might find that a difficult task. 
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But, if he l6 told that the responaea (Integera) can be derived uniquely from 
the integera in the firat, third, and fourth poaitiona of the atiouli, it 
might be eaaier. It would undoubtedly be atill eaaier, if he ic told that the 

operations of addition and subtraction are involved* 

More is involved in this example than in the first. The learner is required 
not only to discover the underlying principle, and a more complex principle at 
that, but hints have been given as to how he might accomplish the task. 

How might these contingencies be formulated in terms of associations? An 
immediate thought is that the position and operation cues might be viewed as 

mediating links, 0- $ . - {J). 

/ 

^;3 Positions— 2 Operations^ 

$2 0. R2 

Dut, as a quick look will make clear, that simply does not work. Further 
elaboration would he necessary to indicate why Sj goes with Ri , and Sg with Ej. 
Of course, there undoubtedly are other alternatives, but I think you will agres 
that any S-R ropresentatlo-.i of these contlnencies is likely to be extremely 

complex* 

It is laportant to emphasize that this situation was not picked arbitrarily 
simply to embarrass S-R psychologists. Stimulus dimensions (e.g., color, post - 
tion), which uniquely determine the responses and combining operations (e.g., 
...ft. the color, addition and subtraction), by which the responses are derived 
from these stimulus properties, appear to be crucial aspects of all principles. 

Fortunately, these characteristics play a central role in the SFL. In fact 
it is assumed that four elements, I, D, 0, R, are needed to specify a principle. 
The stimulus properties in the set, I, tell when the principle is to be used; 
those in D tell which properties determine the response; and the combining 
operation, 0, tells how the response properties, R, may be derived from those 
in D? Obviously, the attribute and operation cues. Involved in the above 
situation, find natural counterparts in Rhis characterization. Thus, D 
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involves the three dimensions corresponding to the three positions, and 0 is a 
co! 2 po 8 ite operation involving addition followed by subtraction. In addition, the 
response set R involves the integers variable (i.e,, set of integer responses), 
while those stimulus properties (ini.) which determine when the principle is to 
be applied are those associated with being a “four-tuple" of integers. Notice 
that the properties in 1 involve a first, third, and fourth posiition (tnose pro- 
perties associated with^, so that 1 necessarily involves JO, In the first of 
the (two) task X principles, i involved colored triangle, D color, ^ color 
naming, and R the color nas^s. 

Since principles, as well as assooiations and concepts, can be represented 
in the S-R language, it is appropriate to ask whether these notions can also be 
represented in the SFL, in which the principle is taken to be basjx. As can be 
seen in Table 1 this is indeed the case. Furthermore, unlike the S-R language, 
explicit distinctions are made between: (1) the observable S-R instances of a 

principle— the denotation; (2) the principle itself, that which underlies the 
behavior and whose presence can be inferred only indirectly; and (3) statements 
of the principle in symbolic form. 

The denotation of a principle is simply taken to be equivalent to the mathe- 
matical notion of the function, a notion which may be defined as a set of ordered 
S-R pairs such that to each;functionally distinct stimulus there is one corres- 
ponding functionally distinct response. The denotation of a concept is simply 
represented as a function in which there is one response common to all stimuli. 

To represent an ordinary association, the set is further restricted so as to 

include only one S-R pair. 

The principle itself is characterized as an ordered four-tuple (I, D^, 0^, R). 
where 1, Df and R are as previously defined. In the case of concepts and 
associations, there are certain restrictions placed on these .elements, but they 
need not concern us here. Finally, notice that the statements in the column on 
the right are constructed from symbols I> ^ ^ , and R , representing the 

constructs I, D, 0, and R. 
o 



Table 1 



SFL Representations of the Association, Concept, and Principle 



Principle 


1 

Denotation 

(Observables) 


Principle 

Underlying 

Performance 

(Inferred) 


Principle 

Statement 

(Obticrvable) 


» k 

i 

1(8^. T^\m} 1 

‘ 1 
t 

1 


a.2.o,R) 


If V, then 

S' 5 O' (£1) 


Concept 


(Note: One 


(I.S.O.S) 


If I', then 

S' E O' (£' ) 



response) 



Association 



(Note: 

One pair) 






If 1' , then 

£' = O' <£' ) 
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Due to space limitations, I will not go into figure 2 in any detail* let me 
simply point out that while there are common relationships between different 
pairs of the eacoerimental paired-associate (PA) list, none exists in the coa ^ t j y l 
list. The principles involved in the experimental ii%t might be stated, ”If 
black, then shape,” and "If white, then size,” "If a small black circle, then 
circle,” is a candidate for one .of the control list principles. 

Space limitations also demand that I not attempt to detail icy reasons for 
preferring the SFL to the S-R language, I would, hof»v«ir, like to mention three 
reasons why the SFL appe^rs to me to be better suited for formulating research 
on meanis»gful learning, in general, and mathematics And science learning, in 
particular. 

The first major reason is that principles are so critical, even in the 
simplest forms of meaningful learning. And, as we have seen, it is much simpler 
to represen t what is learned in the SFL than it is in terms of more cumberspnwL 
S-R representations. 

My second reason for preferring the SFL is that meaningful learning involves 
the ability to make the appropriate response in a class of responses to any one 
of a class of stimuli. Learning single principles may make such performance 
possible but not single associations. Lest there be some confusion, the stimuli 
and responses to which I am referring may be quite distinctive and not merely 
differ slightly along some physical dimension (l,e, , the stimuli and responses 
may be functionally distinct). In simple learning, where the stimulus dimensions 
are continuous, the ability to give a new response to a new stimulus had tradi- 
tionally been attributed to S-R generalization. Some of our recent data^ , how - 
ever, suggests that such a postulate may be inappropriate in theories of 
meaningful learning. Meaningful transfer may more easily and accurately be 
accounted for in terms of what principle is learned. 

The third and perhaps most important reason is simply that there are many 
situations in meaningful learning— we have reviewed just two— which, .whilp easily 
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represented in SFL terms, are difficult, if not impossible, to formulate in the 
S-R language <. 

^There are other situations vhich pose even more difficulty ror the S«R 
language, while yielding to SFL analyses. During the past year or so we have 
either completed or have SFL*based research under way on the following topics* 

(1) the role of attribute (D) and operation (0} cuing in learning mathematical 
principles, (2) rule generality in mathematics learning, (3) the role of syir 
holism in mathematics learning, and (4) the eacpository presentation of what 
mathematical strategy is learned in discovery learning. In the future, we hope 
to get into even more complcyi cognitive tasks such asi (1) identifying (and 
representing) heuristics of use in mathematical problem solving and proving 
mathematical theorems, (2) extending the SFL so as to provide a rigorous basis 
for ouch things as task analysis and Bloom* s taxonomy of educational objectives 
(cognitive domain), and (3) the construction of instructional sequences to meet 
multiple objectives. So far it appears that the SFL is capable of representing, 
if not all, then certainly a?any of the critical characteristics of such situs " 
tlons. 

In the final analysis, the choice between scientific languages (and theories) 
involves efficiency and cohesiveness as well as the sheer ability to represent or 
account for observable phenomena. It is in this sense that the much heralded 
adaptive quality of the S-R language too often has led psychologists to overlook 
the fact that it is always possible to patch up an existing formulation to meet 
new situations. Parsimony does not simply refer to the maintenance of existing 
concepts but to the formulation of emerging structures in the simplest possible 
way. I might add, in conclusion, that what is presently being done with the 
S-R language (e.g. , the incorporation of hlerarahies and motivation factors) is 
quite analogous to what pre-Copernican astronomers were doing when they invented 
epicycles to represent planetary motions in an attempt to salvage geocentric 



theory* 

Having made that emotion- laden comment, I rest my case. 



NOTES 



1. Thl8 article Is based on a paper read at the American Educational Research 
Association convention in Chicago on February 19, 1966, A description 

of some of the recent empirical research, based on the Set-Function 
Language, may be found in my "Precision in Research on Mathematics 
Learning," Journal of Research in Science Teaching , lV(December, 1966), 

253-274. 

2. F.J. McDonald, "Meaningful Learning and Retention; Task and Method 
Variables," Review of Educational Research, XXXIV<December, 1964), 542. 

3. J.G. Greeno and JJtS. Scandura, "All-or-None Transfer Based on Verbally 
Mediated Concepts," Journal of Mathematical P sychology, III, (July, 1966) 
388-411. 

4. Typically, a distinction is made between a mediating response. My, and 
its stimulus properties. Mg, so that the chain S -4My— vMg— )R would 
be used rather than S— >Mrs -tR* The shorter form suffices for present 

purposes. 

5. This representation was introduced by R.M. Gagne (e.g. , "The Conditions 
of Learning" New York; Holt, Rinehart & Winston, 1965). 

6. More recently, I have found it useful to make a distinction between 
rules and principles, a rule being completely characterized by D, 0, 
and R, The interested reader is referred to the references in footnotes 

1 and 7. 

7. In J. M. Scandura, E*. Woodward, and F. Lee, "Rule Generality in Mathematics 
Learnlngi" American Educational Research Journal , in press, no S-R 
generalization gradient was found in learning mathematical rules. 

Such a result runs contrary to what would be eacpected ;by Most S-R 
theorists on the basis of experiments on simple learning. 
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"CONCEPT” LEARNING IN ^^lATHOIATICS* 

(Research in the Emerging Discipline of Psycho-Mathematics) 

Joseph H. Scandura 



Doing basic research on mathematics learning is a risky business. 
In the first place, it is hard to know whether one is asking the right 
questions and, in the second place, it has been difficult to formulate 
significant questions involving mathematics learning in a researchable 
form. One of the major reasons for this state of affairs has been the 
lack of any suitable theoretical superstructure from which to work. 

The purpose of this paper is to outline some of the theoretical work 
underway at the University of Pennsylvania and to show how this 
theoretical work has helped to improve our understanding of how 
mathematics is learned. 



*This paper is based on invited talks given during the Spring of 1967 
at the University of Delav/are and the State Supervisors of Mathematics 
Meeting and the NCTM convention in Las Vegas. The author would like 
to thank Joan Bracket for her assistance in the preparation of this 
article. Her participation was made possible by a graduate research 
training fellowship supported by the U.S. Office of Education. 



background 



Theoretical development la educational psychology has been extremely 
slow. One major reason has been the typically imprecise definition of 
independent and dependent variables in research on meaningful learning and 
teaching. Such research can bear only an ambiguous relationship to theory* 
Similarities and essential differences often go undetected* Stating 
research objectives and defining variables in unambiguous terms, however, 
is not sufficient* The teaching-learning process has all too frequently 
been studied in terms of such "administrative’' variables as class size, 
grade level, X*Q*, and amount of teaching experience* To be theoretically 
relevant, the variables chosen must have broad explanatory potential* 

They shoula not merely be symtomatlc of and Inextricably related to the 
question at hand* Theory development depends on much more than mere fact 
finding . 

To provide a substantive base for their research, educational psychol- 
ogists have frequently resorted to the languages, paradigms, and theories 
of the mother science of psychology* Mediatlonal elaborations and operant 
conditioning paradigms of the S-R language and more general, but less well 
specified, cognitive theories have been popular* 

Each approach has important limitations*^ From one point of view, 

^Joseph M* Scandura, "The Basic Unit in Meaningful Learning— Association 
or Principle?", The School Review * (1967, in press)* 

Joseph M* Scandura, "Precision in Research on Mathematics Learning (The 
Emerging Field of Psycho-Mathematics)*” Journal of Research i n 
Science Teachin g* (1967)* 

Joseph M* Scandura, The Mathematical Formulation of Educational Research 
Questions . (Washington; U*S* Office of Education (Project 6-8002), 
1967)* 



parsliuony suggests that the properties of overt S-R associations should 
also be attributed to mediational links. Yet, practice has shown that 
mediational interpretations become increasingly cumbersome and less precise 
as situations become mere complex* Similar difficulties have plagued 
researchers who have used operant techniques to study meaningful verbal 
learning. The results simply are nox^here near as clear in complex human 
learning as they are in the ’’Skinner Box*” It is increasingly recognized, 
for example, that knowledge of results is not directly analogous to feeding 
a pigeon and that, in a?ry case, other factors, such as subject matter 
structure, are probably of greater importance in promoting efficient 
learning* A general limitation of cognitive theories is their relative 
imprecision* Typically, ’’cognitions” are either not clearly specified in 
observable terms or are only partially defined. Under these conditions it 
has been impossible to construct a predictive theory— -the sort of theory 
needed if practical implications are to be obtained. 

In short, the choice to date has been between a precise, but seemingly 
inappropriate S-R language, and presumably more relevant cognitive 
formulations which leave much to be deslrea Insofar as scientific cohesive- 
ness and rigor are concerned. 

In an attempt to overcome these difficulties, I have proposed a new 
scientific language for formulating research questions on meaningful 
learning* Because the language is framed in terms of the mathematical 
notions of sets and functions, the name Set-Function Language (SFL) was 
adopted*^ 

I won't go into detail here, but before outlining the SFL it may be 
instructive to briefly consider the S-R mediation language. In S-R 
psychology, the basic building block is the association * a construct which 

^See footnote 1* The descriptor ’’Set-Function” should not be confused with 
set functions* 
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was abstracted directly from observeci cotinectlosis between overt stimuli 
and overt responses* 

Loarr:ln3 a concept, presuiaably a more complex form of learning, 
involves the ab ility to give a common response to any one of a set of 
stimuli. To say that a subject has acquired the concept of ’’red,” for 
exarpLc:, implies that he is able to give some common response, when shown 
any red stimulus object, but will know not to give this response to any 
non*'i:ud stimulus* Similarly, a child may be said to have acquired the 
concept of "four” if he can say "four" when presented with any conglom- 
eration of four objects but xcill not say four to any conglomeration not 
containing four objects— i.e. , assuming, of course, th^Jt the child is 
operating under the same set of Instructions in each case* In short, 
whereas an association pairs one stimulus with one response, a concept is 
a many-to-one relationship.^ 

S-R theoriots have felt obliged to represent the many-to-one concept 
relationship as a composite of one-to-one associations. 
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Notice that the stimuli Sj^, $2, and are connected to the mediating 
response whose stimulus properties, in turn, elicit the observable 
response R* 



The Basic U?:lt in Meaningful Learning 
Most subject matter learning involves neither associations nor concepts 
but, as they have been variously called by ulfferent investigators, rules, 



should like tc point out that in talking about single stimuli (responses) , 
I am actually referring to equivalence classes of overt stimuli (responses) 
in which the membnrs (i»e., the stimuli) of the classes are either 
indi3tingrishab?v.e or ot.herwipe play exactly the same role. For example, 
the stimuli ** 5 ”, ”flve'% "Five” , are all equivalent in so far as the 
number five (as opposed to the numeical ’’five”) is concerned. 



principles, schemas, heuristics, and TOTE units* This is true even more 
so of mathematics learning* To be more specific, meaningful learning 
implies the ability to give the appropriate response in a class of 
responses to any stimulus in a class of stimuli* Unfortunately, this fact 
has often been overlooked because the term "concept** has been used so 
wlciply in discussing subject matters* When we say that a child "has the 
cor .ee> »t of addition," for example, what we probably mean la that he can 
give the appropriate sum when presented with any pair of numbers* Put 
another way, the learning involved connects a large class of stimuli with 
a large class of responses* 3y definition, a concept connects a class of 
stimuli to exactly one response* 

To see what is involved in meaningful learning, learn the following 
S-R pairs (l*e*, overt Inputs and outputs): (4 3 1) •>3, (C 1 6) 2, 

(7 9 2) -► 5, and (9 5 1) -► 8* Now, on the basis of what you have just 
learned, give what you think should be the responses to the stimuli (7 2 1) 
and (4 7 2). Did you give the responses 6 and 2? so, you probably 
acquired a unit of knowledge (l*e*, rule) which might be stated. Subtract 
the number in the third position from that in the first." If you did not 
give these responses, you presumably learned the pairs on the left as 
discrete entities — i*e*, as distinct associations without noticing any 
relationships between them* (Of course, I diun’t indicate that there was 
any such relationship so you may not have been looking for one*) In this 
case, the rule governed responses, 6 and 2, would be expected on the basis 
of chance alone* 

I should like to emphasize that this situation was not picked 
arbitrarily simply to embarass S-R psychologists. I?hereas a "patch job" 
can be done with certain special cases, I believe that I am safe in saying 
that to date no satisfactory way of representing rules ^ ^ solely in terms 



of associations has been found. *♦ Stimulus dimensions which uniquely 
determine the responses (e.g., the first and third positions) nnd the 



combining operation or transformation by which the responses are derived 
from the determining stimulus properties (e.g.* subtraction) appear to be 
crucial aspects of all rules. While stimulus properties and derivea 
responses play a central role in S-R mediation theory, ^ there is no 
counterpart for the transform or combining operation. 

The Set^Function Language (SFLj_ 

Fortunately, all of these characteristics play a central role in the 
SFL. In fact, have proposed that four characteristics are needed to 
specify a principle . Three of these characteristics specify a rule and a 
fourth determines when the rule is to be applied. Those stimulus 
properties which determine (D) the corresponding responses constitute 
one such characteristic, the covert responses or derived stimulus properties 
(R) are another, and the transform or combining operation (0) by which 
these covert responses are derived from the determining properties is the 
third. The fourth consists of those, usually higher order, stimulus 
properties which identify (I) the rule to be applied. For example, the 
rule, for summing number series, where N is the number of terms, applies 
only to those series which consist of the consecutive odd integers 

beginning with one (e.g., 1+ 3 + 5 + 7 *4^ « 16). 

Since principles, as well as associations and concepts, can be 
represented in the S-R mediation language, it is appropriate to ask whether 
these notions can also be represented la the SFL, in which rules and 
principles are taken to be basic. This is indeed the case. Furthermore, 

^See footnote 1. 

^hey correspond, in fact, to mediating responses and their stimulus 
properties, respectively. 

^Sce footnote 1. 



unlike the S-R language, explicit distinctions are made between: (1) the 

observable S-R instances of a rule— the denotation, (2) the rule or 
principle itself, that which underlies the behavior and whose presence can 
be inferred only indirectly, and (3) statements of the rule or principle 
in symbolic form. 

The denotation is simply a function « a (mathematical) notion which may 
be defined as a set of ordered stimulus-response pairs such that to each 
stimulus there is one corresponding response. The denotation of a concept 
is simply represented as a constant function in which there is one response 
coirmon to all stimuli. To represent an ordinary association, the set is 
further restricted so as to Include only one S-R pair. 

The rule construct is characterized as an ordered triple (D, 0, R), 
where D, 0, and R are as defined above. Principles, of course » are ordered 
four-tuples (I, D, 0, R). In the case of concepts and associations, there 
are certain relationships among these characteristics,^ but they need not 
concern us here. In stating rules or principles, I have used primes to 
distinguish the signs used to represent the constructs I, D, 0, and R 
from the constructs themselves. 

The one point I would like to emphasize is that even S-R theorists 
have been forced to adopt the idea of a transformation or combining 

O 

operation in order to represent meaningful learning and thinking. Berlyne 
has done this, for example, in his fine book Structure and Direction In 
Thinking . In his latest article on the topic of principle learning, Gagne^ 

^For more details see Joseph M. Scandura, The Mathematical Form ulation of 
Educational Research Questions. (Washington: U. S. Office of Education 
(Project 6-8002), 1967). 

®David E. Berlyne, Structure and Direction In Thinking (New York: Wiley, 

1965) . 

^Robert •!. Gagne, "The Learning of Principles,” in H. J. Klausmeier and 
C. W. Harris Analysis of Concept Learning (New York: Academic Press, 

1966) , pp. 81-96. 



has adopted this point of view as well* Even many long-tiiae mediation 
enthusiasts have recently become convinced that mediation theory is 
inappropriate for dealing with verbal learning*^® 

In so doing, these theorists are in fact aenyin& the primacy of the 
association as a construct* The idea of a transformation or combining 
operation takes over this primary role* I might note parenthetically that 
what here have been called response determining properties of stimuli, S-k 
theorists have called mediating responses. Similarly, our covert responses 
(or, derived stimulus properties) correspond to stimulus properties of 
mediating responses * yiiat the combining oncrat ion does is t o make explicit 
how the mediating stimuli are determined from the antecedent_nediati^ 
responses * VJhile this process may not be important in many forms of simple 
learning, we have just seen here how it becomes crucial in rule learning* 

The transformat5.on, mapping, combining operation, or whatever term is used, 
becomes of central concern, and to still call any suitable representation 
associatlonlstic or even neo-associationistic is stretching the term beyond 
its reasonable limits* 

In the final analysis, of course, the choice between scientific 
languages (and theories) involves efficiency and cohesiveness as vjell as the 
sheer ability to represent or account for observable phenomena* It Is in 
this sense, that the much heralded adaptive quality of the S-R language too 
often has led psychologists to overlook the fact that it is always possible 
to ”patch“up" an existing fomulation to meet new situations* Parsimony 
does not simply refer to the maintenance of existing concepts but to the 
formulation of emerging structures in the simplest possible way* I might 
add that what is presently being done with the S-R language (e*g*, the 
inclusion of associative structures, reference mechanisms, etc*) is quite 



^^Personal communication* 



analogous to what pre-Copernlcan astronomers were doing when they invented 
epicycles to represent planetary motions in an attempt to salvage geo-centric 

theory . 

EMPIRICAL RESEARCd BASED OM THE SFL 

Let me turn now to a consideration of two problems that have long 
plagued matbematics educators, the problems of rule-j^e nerallty and ^^coyery^ 
learning . In the space remaining, I shall attempt at least partial 
resolutions of these two probj.etDs and, in the process, will indicate how 
the SPL helped to achi'^ve these resolutions by wahing it possible to 
formulate the underlying questions in a precise way. Ernest Woodward and 
Frank Lee assisted me with tha Rule-Generality Study and the Discovery 
Learning Study was conducted with now Dr. William Roughead. 

R ule Ge.nGralitv^ ^ 

In instructional situations, the question often arises as to how 
general the presentation of material ought to be. Subject matter 
specialists and most educators tend to emphasize that the more general the 
presentation, the more useful it will be. Learning oriented experimental 
psychologists, on the other hand, are often inclined to point out that the 
more speci^iic the precentation, the better the learning. 

The qao.sticn of generality arises particularly often in mathematics 
instruction. Should addition and subtraction be taught as two distinct, 
although related, operat?.ons, as has been done traditionally, or as one 
operation as is done in more modern treatments? Should the three cases of 
percentage be taught separately or as variants of the rule, “base X rate * 
percentage?" Should pupils be taught the method of "casting out nines" or 

^^Th:’c section is based on Joseph M. Scandura, Ernest Woodward, and Frank 
Lee, "Rule Generality and Consistency In Mathematics Learning," American 
Educational Research Journal. (1967, in press). 



be taught the note general principles of modular arithmetic? How 
generally should theorems be stated? Proofs? The answer to these and 
related questions hinges, iu part, on the leamabillty and utility or scope 
of the rules involved. 

I^thematics educators have concerned themselves with such questions, 
but they have bad to make judgments on largely intuitive grounds. There 
is a real tc better understand the psychological principles involved. 

Unfortunately, hc>7ever, previous studies involving rule learning have 
dealt only incideatally v/lth this problem. Perhaps mere important, even 
the best designed studies in this area arc subject to criticism for 
failure to distinguish betweexi structure aid behavior variables. The 
variables chosta# (e.g., rule and example given, discovery, answer given) 
are often merely symptomatic of, rather than basic to, what is involved. 

The fundamental assumption underlying our approach to the problem 
was that more rapid progress can be made by distinguishing clearly between 
structure and behavior variables and by identifying the important parameters 
of each. I would go even farther and say that we can never hope to 
understand mat htiwa tics and other subject matter learning without making 
such a distinction. 

At the time the gc:nerality study was designed, the SFL had only 
developed to the peint where rolea were defined in terms of their 
denotative sets of orde:tfed stimulus-response pairs. No consideration was 
given t:o the nature of the underlying rule construct. Even so, the fact 
that sets can be ordered as to their inclusiveness led naturally to the 
question of rule generality, lict only did this question have practical 
relevance but, more important from the standpoint of theory j the SFL 
provided a basis for rigorously defining just what is meant by generality. 
One rile is saiu to be more general than another if the denotative set of 
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the former includes that of the latter as a archer subset (i.e., the 
former set Includes all of the instances of the latter plus some of its 

own) • 

Our primary motivation for the rule generality study, then, was to 
”try out” this definition, to see if it had the sort of straight forward 
behavioral implications we had hoped. In particular, notice that all S-R 
insi ces of a principle are treated equally. ^ stimulus within the 
scope of a rule snould provide an adequate test of its acquisition. 

Similarly, performance on extra-scope test stimuli should be uniformly 
nil. Once learned, a highly general rule would, of course, be expected to 
induce appropriate performance on a wide variety of tasks. At the same 
time, however, it is quite possible that the ease of learning a rule 
statement, as judged by the ability to use it, may vary directly with its 

specificity. 

Another facet of our rule generality research concerned the consistency 
with which a learned rule is applied. In my earlier research, it was 
found that, under certain conditions, experimental subjects respond 
consistently in accordance with a derived rule. When told that their first 
response was correct, those subjects who used a rule as the basis for 
responding to a first test item also used the rule on a second test item. 
These pilot results were obtained in a discovery-learning situation with 
simple materials. There was a need to extend this finding to more complex 
subject matters which are presented by exposition. In general, we found 
strong support for this contention. Apparently, people tend to respond in 
a consistent manner unless the context is changed or feedback otherwise 
indicates that the rules have changed. 

e.g.. See Joseph M. Scandura, "Teaching-Technology or Theory,” i^ric^ 
Educational Research Journal ^ 1966, 3 , 139-146. 
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For the purposes of this Uiscusslou* we shall be primarily concerned 
with only two hypotheses. First, performance on problems within the scope 
of a rule does not differ appreciably and su*:cessful problem solving is 
limited almost exclusively to within-scope problems. Second, the ease 
of learning a rule statement so that it can be applied to within* scope 
problems varies directly with the rulers specificity. Thus, the more 
general a rule statement, the poorer the learning. 

To test these hypotheses we conducted two experiments, only one of 
which I shall outline here. The crux of the experimental design and the 
results can be seen in Table I. The three groups of experimental subjects 
(Ss) were undergraduates enrolled in a mathematics education course for 
elementary teachers. Each group was presented with one of three rule 
statements of varying generality. All of the ^s were then tested on the 
same three problems. Rule S was the most specific and was appropriate for 
solving any problem in the certain class— problem one was one such problem. 
Rule SG was more general and was potentially applicable to a wider range 
of problems. In particular, it was logically sufficient to solve both 
problems one and two, but not problem three. Rule G was the most general 
rulej it was applicable to all three test problems. 

T/^LE I. 

HUMBER OF CORRECT SOLUTIONS 
N Problem One Problem Two Probl emJThr^e 



Group S 


17 


13 


0 


1 


Group SG 


17 


5 


4 


0 


Group G 


17 


5 


5 


4 



In agreement with the first hypothesis, there was almost no extra- 
scope transfer and performanca on \;ithin-scope problems was essentially 

» 
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the same. Notice, in particular, that the performance obtained could be 
predicted on the basis of a strictly logical argument. If people learn 
an unfamiliar rule ^ ^ then they ahoulo be able to apply it to any 
problem or stimulus within the scope of the rule but not to similar problems 

beyond the scope of the rule.^^ 

Defining rule generality in temus of the scope of the respective 
denotative sets of S-R pairs, however, does not provide a sufficient basis 
for explaining the results pertaining to the second hypothesis which was 
concerned with the learnabllity of rule statements of different generality. 



In the beginning, this hypothesis was based simply on intuition. 

To check this hypothesis, consider the performance of the three groups 
on problem one. Since this was the only problem within tae scope of all 
three rules, we expect on the basis of our hypothesis that group S would 
do better than group SG x^hich, in turn, would do better than group G. As 
you can see, group S did do much better than the otners, but the 
performance of the subjects in groups SG and G did not differ appreciably. 

To explain these results, it was found necessary to define an under- 
lying construct, a cognitive competency underlying the behavior we observed. 
This led me to the four-tuple characterization previously outlined. When 
the competencies necessary for interpreting the rule statements of varying 
generality were analyzed it became apparent that the more general the rule, 
the more is required of the learner. Thus, to apply a highly general rule 
statement, once memorized, requires that the learner be able to apply any 
rule of lesser generality but not conversely. To subtract any two numbers, 



for example, implies that 2 can be subtracted from 6, but being able to 
subtract 2 from 6 does not imply that the learner can subtract with any 



^^Actually, the facts are not quite this simple, but the statement appears 
to be a good first approximation. John Durnln and I have a study unoerway 
in the Penn Laboratory which we hope will add further clarification. 
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two numbers. 

The fact that performance of the groups given the most general (G) 
and intenuediate (3G) rule statements did not differ can be attributed to 
prior learning. In effect, those additional abilities needed to Interpret 
and, hence, to apply rule G were not likely to hr».ve caused the college 
students involved any difficulty. 

Cince postdiction is hell in generally low regard in the psychological 
(but not scientific) co.nmunlty, let ne add a brief word of defense. In 
this particular study, we beg'Sia with a precise dafiuition of rule generality 
based on a preliminary version of the SFL; we conducted a study, and, 
finally, we used the results to extend and othenrise Improve the very 
foundations on which our a priori analysis had been based. 

What educational Implications can be drawn from this study? To begin 
with, the results of these experiments demonstrate, in a rather conclusive 
fashion, the behavioral relevance of rule generality. For the most part, 
successful performance was noted only on tasks within the scope of 
verbally stated rules. When rules are presented in an expository fashion, 
it is normally toe much to expect generalization to problems to which the 
principle does net Immediately apply. 

Potentially of even greater practical significance was the lack 
(there was one exception) of performance differences on the within-scope 
problems and the consistency results cited earlier. The former result 
demonstrates that, under certain specifiable conditions, any stimulus 
within the scope of a rule is equally as difficult to respond to correctly 
as any other. Furthermore, the obtained consistency results suggest that 
only oM (new) test stlmulrs is needed to determine whether, in fact, a 

^^See footnote 13. I might add that some transfer does take place and we 
are now attempting to pin down the source of this transfer. 



given rule has been learned* Under certain specifiable conditions > no 
core information is gained by using additional test instances* These 
refU-'lts could have far-reaching Implicatioas for the development of hlglily 
eff i cient measuring Instruments * 

In addition, the pronounced tendency of the ^s to attack all of the 
test piob]).c}Q3 in the same way, irrespective of whether the procedure used 
was approprlatCj suggests that knowing how to solve problems and knowing 
when to use this knowledge are quite distinct* Testing for the latter 
ability necessarily mnct involve the presentation of extra-scope problems* 

Host important, ir. view of the problem I posed originally, this study 
helps to reconcile the views of subject matter and learning specialists 
as to how general the presentation of material ought to be* While one may 
expect maxiiaum transfer potential by introducing rules of the greatest 
possible generality, th is transfer potential Is bought at a price the 
teacher may or mrv not be wlMing to nay * The greater its generality, the 
harder a rule statement is to Interpret* Hence, before determining how 
general a rule statement , should be,. It is essential to first consider 
whether the. students involved have the necessary requisite interpretative 
abilities . * ^ 

Actually many teachers already do this, at least intuitively* All 
that the present study does is to make these intuitions more explicit* 
Frankly, I am always pleased when our results conform to what might be 
called "eoramon sense*" Too often psychological results, while perhaps 
relevant to animal learning or the memorization of nonsense-syllable lists ; 
have very little to say about the learning of mathematics and other subject 
matters* 

^ ^Joseph M* Scandura, "On Ability Treatment Interactions (The Learning and 
Intcrpretability of Synbolically and Verbally Stated Mathematical Rules)," 
Jou=:? ial of Educational Psychology (1967, in press)* 



Discovery Learning ^ ^ 

One of the fundamental asoumptlons underlying several of the new 
mathematics programs is that discovery methods of teacning and learning 
incre^.se the student *s ability to learn new mathematics* Indeed* this 
assumption has guided the development of many new curricula in all of the 
suVjact matter fields. Attempts to demonstrate advantages or disadvantages 
of self-discovery, however, have either failed, been dpen to criticism on 
scientific grounds, or are seemingly inconsistent even when apparently 
well-controlled. 

Research on discovery learning has been confounded by differences in 
terminology, the frequent use of multiple dependent measures, and vagueness 
as to what is being taught and discovered. Wliile the difficulties due to 
the use cf inconsistent terminology can often be minimised by a careful 
reading of rosearch reports, the use of multiple dependent measures often 
maker it Ipipossibte to unambiguously interpret experimental results. 

Several Investigators, for example, have found that groups which are given 
an expository statement of a rule perform better on transfer tests than 
groups which are required to discover this rule for themselves from Instances 
of the rule. The obtained differences in transfer ability, however, may 
well hav« been because the discovery groups simply did not discover the 
rule. 

Gagne and Brown^^ overcame the dependent measure problem by equating 

original learning and investigating only transfer differences on new 

^ ^William 6. Roughead and Joseph Scandura, "Sequencing and the 
Expository Presentation of What is Learned in Mathematical Discovery, 

(paper read at the A.E.R.Ao convention in New York on February 18, 1967). 

l^Robert II. Gagne and L, T. Brown, "Some Factors in the Programming of 
Conceptual Learning," Journal of Experimental Psychology (Vol. 62, 1961), 
pp* 313-321* 



problems. On the basis of an analysis of the learning programs used in 
the Gagne and Brown study » Eldredge hypothesized that the obtained results 
could have been due to a number of flaws in the programs used» fildredge 
proposed that ejcpositi.on and discovery situations may be better 
character.^ zed as differences in order of presentation. Exposition may be 
defined as giving rules and then examples of these rules j whereas discovery 
may be defined as giving the examples and then the rules. Contrary to his 
hypothesis, however, his discovery group did evidence more transfer than 
his exposition group. Unfortunately, there were a number of difficulties 
with the study that make the results difficult to interpret. 

The Set-Function Language was used as an aid in removing these 
difficulties. The resulting analysis of what is involved in discovering 
rules indicates that discovery learners learn "something" by which they 
can derive solutions to an entire class of problems. Roughead and I 
called this "something" a derivation rule . Thus, discovery learners who 
actually succeed in making a discovery, should be expected to perform 
better than expository learners on tasks which are within the scope of 
such a derivation rule. If the nev’ problems presented have solutions 
beyond the scope of a discovered derivation rule) however, there would 
be no reason to expect discovery £s to have any special advantage. 

This study was concerned with two basic questions. First, can "what 
is learned" by discovery be identified and if so, can that knowledge be 
taught by exposition with equivalent results? According to the SFL-. all 
behavior is controlled by rules so that there might well be some identifiable 
rule which is equivalent to "what is learned” by discovery. Specifically, 

^®Gart:h M. Elured?re, "Discovery vs. Expository Sequencing in a Programmed 



Unit on Suiaming ' 
Eldredge, and BIj 
and Transf er of j 
1965 ) . 




we hypothesized that "what is learned” by guided discovery In the Gagne and 
Brown study could be identified and, hence, could be presented by 
exposition. The second question was, how is "what is learned” by discovery 
dependent on what the learner already knows and/or the nature of the 
discovery treatment itself? More particularly, we hypothesized that the 
discovery of a derivation rul m actually be hindered by having too 
much prior information. 

Assuming transfer depends only on whether or not the deriva^ ion rule 
is learned, sequence of presentation should have no effect on transfer so 
long as the subject is forced to learn the underlying derivation rule. 

That Is, presenting the derivation rule by exposition or by guided 
discovery either before or after presenting the desired responses should 
have no effect on performance on transfer tasks. On the other hand. If a 
discovery program simply provides an opportunity to discover (with hints 
as to the solution) but does not guide the learner through the derivation 
procedure, sequence of presentation might well have a large effect on 
transfer. Assuming the learner Is capable and motivated, he may well 
succeed In determining the appropriate responses and. In the process, 
discover a derivation rule. It is not likely, however, that a person 
would learn such a derivation rule If he already knew the correct 
responses. 

We made three hypotheses: 1) what is learned by guided discovery 

can be presented by exposition with equivalent results; 2) presentation 
order is not critical when learners are effectively "forced” to learn 
derivation rules, either by exposition or by guideu discovery* 3) presen- 
tation order is critical when the discovery guidance provided is specific 
to the respective responses sought rather than relevant to a general 
strategy or derivation rule. 
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The task we used was essentially Identical to that used by Gagne and 
Brown, and Eldredge and Involved finding formulas for summing the terms 
in number sequences. That is. the stimuli were number series, like 1 + 

3 + 5 + ang the responses were formulas in n, the number of terms, for 
summing such series. For example, the appropriate formula for summing 
1 + 3 + 5 + 7 + ... + 2n-l is n^. 

Using the SFL as a guide (l.e., by identifying, in turn, U, 0, and R) 
we were able to identify that derivation rule taught in the guioed 
discovery program used by Gagne and Brown. On the basis of this knovrledge, 
four programs were constructed; 1) the formula-siven program simply 
stated the correct summing formula for each problem series confronted in 
the l»aro1 ns program; 2) the euided discovery, program remained essentially 
as it was in the earlier studies; 3) the expositor program consisted of a 
precise expository description of that derivation rule which was presumably 
equivalent to that learned by guided ciscovery. It consisted of a general 
procedure by which the desired formulas could be derived; 4) in the 
„^P»,r..n<t-„-to-dlscover program, the problem sequences were presented along 
with encouragement and hints as to wh a^ the desired formulas were These 
hints Involved such statements as "the formula has a '2' in it." The same 
number sequences were used in each of these four programs. 

Seven treatments were constructed by combining these four basic 
programs. After going through a common introductory program, one group 
of subjects simply went through the formula program. The other six groups 
received both the formula program together with one of the other three 
programs. Two of these six groups received the guided discovery program 
together with the formula program; two additional groups received the 
expository and formula programs; and the final two groups received the 
opportunity-to-dlscover and formula programs. One group, in each of the 





resulting three pairs, received the programs in one order; the other group 
received them in the reverse order. Only the order of presentation was 
varied. After finishing their respective programs, all of the students were 
tested on new series to see how well they could determine the appropriate 

sunming formulas. 

The results were rather clear cut. Essentially, the Rroup__pJyen_^e 
formula program only and the group given the formul a programJollo^OX 
the opportunity to discover program performed at one level. The other fiv e 
groups performeci at a common and significa ntly higher level. Two points 
need to be emphasized. First, "what is learned” during guided discovery 
learning can at least sometimes be taught by exposition— with equivalent 
results. Of course, there are undoubtedly a large number of situations 
where because of the complexity of the situation, "what is learned = during 
discovery can not be clearly identified. It is still an open question, for 
example, whether still higher order derivation rules, vjhich have a more 
general effect on the ability to . learn, may' be learned by discovery. If we 
believe that the ansx^er to this question is affirmative, there is no real 
alternative to learning by discovery unless or until we can identify just 
what is involved. Nonetheless, intuition-based claims that learning by 
self-discovery produces superior ability to solve new problems, as opposed 
to learning by exposition, has not withstood experimental test. The value 
of some forms of discovery to transfer ability does not appear to exceed 
the value of some forms of exposition. Apparently, the discovery myth has 
come into being not so much because teaching by exposition is a poor 
technique as such but because what has typically been taught by exposition 
leaves much to be desired. As we identify Just what it is that is learned 
by discovery in more and more situations, we shall be in an increasingly 
better position to impart that same knowledge by exposition. 
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The secona point to be enphasited concerns the sequence effect. While 
the group that was given an opportunity to discover and then the formula 
program performed as well on the transfer problems as those given the 
derivation principle in a more direct fashion, the group given these 
programs in the reverse order (l.e. , the formula-opportunity group) did no 
better than those f'l given the formula program alone. In effect, if a 
person already knows the desired responses, then he is likely not to discover 



a more general derivation rule* 

An extrapolation of this result suggests that if S knows a specific 
rule, then he may not learn one which is more general even if he has all 
of the prerequisites and is given the opportunity to do so. The reverse 
order of presentation may enhance discovery without making it more difficult 



to learn more specific rules at a later time. In effect, prior knowleuge 
may actually interfere in a very substantial way with later opportunities 

to discover. 

This sequencing result may have important practical and theoretical 
Implications. The practical implications tzill be attested to by any junior 
high school mathematics teacher who has attempted to teach the ’’meaning 
underlying the Various computational algorithms after the children have 
already learned to compute. The children, in effect, must say to themselves 
something like, "I already know how to get the answer. VIhy should 1 care 
why the procedure works?” Similarly, drilling students in their 
multiplication facts before they know what it means to multiply, may 



interfere with their later learning what multiplication is. Let me make 

spite of this fact there way be some advantages inherent in learning 
more specific rules. Mthough the data are not entirely clear 
ooint. it is quite possible th.at specific rules may make it possible to 
deteruW responses more quickly than rules which are more general. 
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this point clear • because it is an. important one* I am not saying that 
we should teach meaning first simply out of some sort of dislike for rote 
leaming*~for certain purposes rote learning may be quite adequate and the 
most efficient procedure to follow* What I ^ saying is that learning such 
things as how to multiply without knowing what multiplication means • may 
actually make it more difficult to learn the underlying meaning later on* 
The theoretical implications are even more Interesting for the researcher 
and, in fact, laay be crucial to any theory based on the rule construct and 
framed in the SFL«*~but space limitations demand that X not go into that 
here* 

In addition to the studies described above, we have conducted a number 
of other studies, which are based on the SFL, but space limitations make 
it impossible to more than mention them here. One of these studies is 
designed to help clarify the role of attribute and operation cueing in 
learning mathematical rules* Another deals with the role symbolism plays 
in mathematics learning* We are also involved iu developing a completely 
new methods course in mathematics for elementary school teachers which is 
intimately tied to this point of view* 

Concluding; Remarks 

In this paper, I have tried to share with you some of my thoughts on 
the psychology of matheioatlcs learning— or what I like to think of as the- 
emerg ing discipline of pG vchO”inatheiaatics * As is obvious by now, I chose 
not to do this in a direct manner but by pointing out certain inadequacies 
in existing behavioristic theory as it relates to mathematics learning and, 
more important, by describing an alternative sciHntific language and 
showii'.g how it caii be used to formulate research questions involving 
mathematical learning and performance* 



The mathematical notions of sets and functions were proposed as a 
basis for representing; (1) the denotative or observable aspect of rules 
and principles, (2) the underlying knowledge itself, and (3) meta-linguistic 
descriptions or representations of the underlying knowledge. Very recently, 

I have become intrigued with the idea of integrating these ideas by 
borrowing the very funaamental but more abstruse mathematical idea of a 
func tor. The functor may also make it possible to distinguish in a very 
precise way between the sort of **ideal** competencies which have long been 
championed by linguists and competencies as they actually exist in human 
beings. In the present version of the SFL, it has only been possible to 

deal with idealized rules (l,e,, competencies). 

To insure continued progress, it seems to me that a dual emphasis is 
needed in nsvcho-mathematlcs , On the one hand, there are a large number of 
unspecified, but crucial, "ideal” competencies which underlie mathematical 
behavior. These need to be identified. This is a problem area which in 
many ways is analogous to linguistics and whose solution will require a 
substantial knovledge of mathematics coupled with a behavioral point of 
view. There is also the. urgent need to consider how the inherent capacities 
of learnerc and their pre.viov, 3 ly acquired knowledge interact with new input 
to produce mathe-^.atical learning and performance. Again by analogy, we 
have a fie5d much like psyclio -linguistics, a field which will seek to 
integrate knowledge concerning psychology and mathematJxal structures and 
Strategies, I lael that this kind of distinction will prove crucial to 
any de«p understanding of hew mathematics (and other subject matters) 
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Background 



In spite of an increasing number of basic research studies, 
theoretical development in educational psychology has been ex- 
tremely slow. With few exceptions, the research on meaningful 
learning has been fragmentary. Results have often appeared to be 
inconsistent; similarities and essential differences have gone 
undetected. 

A major reason for this slow progress has been the lack of 
an appropriate scientific language with which to discuss educational 
research problems. Existing languages in psychology fail to 
capture the essence of meaningful learning and teaching. The 
choice to date has been between a precise, but seemingly inap- 
propriate S-R language, and presumably more relevant cognitive 
formulations which leave much to be desired insofar as scientific 
enhesiveness and rigor are concerned. 

At the time this project was initiated, preliminary steps 
had been taken toward the development of a new scientific language 
for formulating research questions on meaningful learning. 

Because the language was framed in terms of the mathematical 
notions of sets and functions, the name Set-Function Language 
(SFL) was adopted. 

Objectives 

The overall objective of this research was to extend, 
elaborate, and otherwise improve the SFL. We sought to determine 
its strengths and weaknesses by applying it to a variety of 
complex learning situations, particularly those involving mathe- 
matics. 

More specifically, we proposed to ; 
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(1) contrast the SFL with the S-R language, 

(2) presenta laethodology for assessing what is learned, 

(3) (re) formulate a variety of research questions in terms 
of the SFL and the related assessment methodology, and 

(4) deLJcribe some empirical research which was based on a 
preliminary formulation of the SFL. 

procedure 

This project required a good deal of developmental activity 
and evolved over a long period of time. The ste^ taken reflect 
this fact. 

Our preliminary thoughts on the SFL are included in paper I— 
’’The Basic Unit In Meaningful Learning— Association Or principle?” 
In this paper, we were concerned with describing the SFL and its 
relationship to the S-R Mediation Language. We presented a 
methodology for sissessing what is learned and (re) formulated 
a variety of research questions in terms of the SFL and the re- 
lated assessment methodology. 

\b our understanding of the problem deepened, we undertook 
the difficult task of refining and extending the SFL in paper II, 
’’Using the Principle to Formulate Research on Meaningful Learning.” 
We were able to do this by characterizing a principle as an ordered 
four-tuple, (I,D,0,R),where;(l) D referred to those stimulus 
properties which determine the responses, (2) 0, to the combining 
operation by which ( 3 ) the response properties (R) are derived 
from those in D, and ( 4 ) I, to those stimulus properties which 
determine when the rule, 0(D) -j^R, is to be applied. We also 
showed in this paper how the SFL aided the development of 
some of our research and how problems concerning reception and 
’ discovery learning, reversal and nonreversal shifts, Piagetian 
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conservation tasks, and symbolic and concrete learning can be 
reformulated in the SFL* 

At about this time, we felt the need for a commentary to 
researchers in mathematics and science education. In paper III, 
•'Precision in Research on Mathematics Learningj The Emerging 
Field of Psycho-Mathematics,” we attempted to provide in-depth 
analyses of certain questions of concern to all mathematics and 
science educators. We indicated how the SFL may help the researcher 
in his quest for new and important questions, and how it may be 
of even more help in formulating his questions in researchable 

forms. 

Paper IV, ”The Basic Unit In Meaningful Learning— Association 
Or Principle?” contrasts SFL and S-R formulations of several 
meaningful learning tasks. These tasks were shown to be easily 
represented in SFL terms and difficult, if not impossible, to 
formulate in the S-R language, vie showed how principles as well 
as associations and concepts can be represented in the SFL and 
indicated areas of SFL-based research either completed or underway 

during the past year. 

The final paper, "Concept Learning In Mathematics (Research in 
the Emerging Discipline of Psycho-Mathematics)” attempts to glean 
some of the highlights of both theory and empirical research 
based on the SFL. We also suggest future directions for psycho- 
mathematics and comment on what may result in a major refinement 

of the SFL. 

Results 

During the year that this project was in operation, a great 
deal was accomplished. It would be impossible to summarize all 
that was learned. In brief, we have devised and refined a new 
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scientific language (SFL) which provides for the description of 
psychologically relevant subject matter characteristics, .Ve have 
presented evidence in favor of using the principle, rather than 
the association, as the basic behavioral unit in meanxngful 
learning, k methodology, based on the SIX, was presented for assess- 
ing "What is learned," We also showed how the SFL and the 
related assessment methodology can be used to (re) formulate 
a variety of research questions (e.g. paired associate principle 
learning, reversal and nonreversal shifts, Piaget-type conserva- 
tion tasks, and symbolic and concrete learning). In addition, a 
number of empirical research studies based on SFL analyses were 
described (e.g. Scandura, Woodward, and Lee ( 196 ?), Greeno and 
Scandura ( 1966 ), Roughead and Scandura ( 1967 ) )• 1 “ 

Roughead and Scandura ( 196 ?) study, for example, the SFL analysis 
of what is involved in discovering mathematical rules indicated 
that discovery learners learn "something" by which they can 
derive solutions to an entire class of problems. We called this 
"something" h derivation rule. Using the SFL as a guide, we were 
able to identify that derivation rule taught in a guided discovery 
program used in an experiment by Gagne and Brown ( 1961 ) . 

Conclusions and Implications 

We have pointed out certain inadequacies in existing 
behavioristic theory as it relates to mathematics learning. 

An alternative scientific language (SFL) was described and it 
was shown how the SFL could be used to formulate research questions 
involving mathematical learning and performance. The SFL helps 
fill the gap between the highly controlled studies of the learning 
laboratory and the more encompassing but lees well specified 



research of the mathematics educator. 
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There are many problems that need to be solved if a useful 
theory of mathematical learning is to be invented. The SFL in 
its present form is not sufficient to deal with all of them. 

For example, no attempt has been made to extend this approach 
to deal with instruction which simultaneously involves several 
objectives. Second, complex problem solving and proving theorems, 
in which there are several stages, have non bcx-n considered. 

Third, the question of whether the SFL can be used to deal with 
highly complex forms of interrelated knowledge is still open 
and, finally, the SFL says little or nothing abouc memory or 
learning efficiency (i.e. time to learn). However, none of these 
problems seems insurmountable and we are presently looking into 
the possibility of extending the SFL in two ways: (1) by using 

it as a basis for instructional planning in mathematics, and 
(2) by looking into the very intriguing possibility of reformulat 
ing the SFL in terms of the more f imdamental , but abstruse, 
mathematical notion of a functor, .ilthough it is too early to 
say what may develop, we are very optimistic. 
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Background 

In spite of an increasing number of basic research studies, 
theoretical development in educational psychology has been ex- 
tremely slow. With few exceptions, the research on meaningful 
learning has been fragmentary. Results have often appeared to be 
inconsistent; similarities and essential differences have gone 
undetected. 

A major reason for this slow progress has been the lack of 
an appropriate scientific language with which to discuss educational 
research problens. Existing languages in psychology fail to 
capture the essence of meaningful learning and teaching. The 
choice to date has been between a precise, but seemingly inap- 
propriate S-R language, and presumably more relevant cognitive 
formulations which leave much to be desired insofar as scientific 

cohesiveness and rigor are concerned. 

At the time this project was initiated, preliminary steps 
had been taken toward the development of a new scientific language 
for formulating research questions on meaningful learning. 

Because the language was framed in terms of the mathematical 
notions of sets and functions, the name Set-Function Language 

(SFL) was adopted. 

Objectives 

The overall objective of this research was to extend, 
elaborate, and otherwise improve the SFL. We sought to determine 
its strengths and weaknesses by applying it to a variety of 
complex learning situations, particularly those involving mathe- 
matics. 

More specificsJ.ly , we proposed to : 



(1) contrast the SFL with the S-R language, 

(2) present a methodology for assessing what is learned, 

(3) (re) formulate a variety of research questions in terms 
of the SFL and the related assessment methodology, and 

(4) describe some empirical research which was based on a 
preliminary formulation of the SFL. 

procedure 

This project required a good deal of developmental activity 
eucid evolved over a long period of time* The ste)pB taken reflect 
this fact* 

Our preliminary thoughts on the SFL are included in paper I— 
»»The Basic Unit In Meaningful Learning— Association Or Principle?” 
In this paper, we were concerned with describing the SFL and its 
relationship to the S-R Mediation Language* We presented a 
methodology for assessing what is learned and (re) formulated 
a variety of research questions in terms of the SFL and the re- 
lated assessment methodology* 

As our understanding of the problem deepened, we undertook 
the difficult task of refining and extending the SFL in paper II, 
"Using the principle to Formulate Research on Meaningful Learning*” 
We were able to do this by characterizing a principle as an ordered 
four-tuple, (I,D,0,R) ,where; (1) D referred to those stimulus 
properties which determine the responses, (2) 0, to the combining 
operation by which (3) the response properties (R) are derived 
from those in D, and (4) I, to those stimulus properties which 
determ5.ne when the rule, 0(D) ^^R, is to be applied* V/e also 
showed in this paper how the SFL aided in the development of 
some of our research and how problems concerning reception and 
discovery learning, reversal and nonreversal shifts, Piagetian 
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con66rva.ti.on tasks f and symbolic and concrst© Isarning can be 
reformulated in the SFL* 

At about this time, we felt the need for a commentary to 
researchers in mathematics and science education* In paper III, 
'•precision in Research on Mathematics Learning! The Emeiging 
Field of psycho -Mathematics,” we attempted to provide in-depth 
analyses of certain questions of concern to all mathematics and 
science educators. We indicated how the SFL may help the researcher 
in his quest for new and important questions, and how it may be 
of even more help in formulating his questions in researchable 

forms. 

Paper IV, ”The Basic Unit In Meaningful Learning-Association 
Or Principle?” contrasts SFL and S-R formulations of several 
meaningful learning tasks. These tasks were shown to be easily 
represented in SFL terms and difficult, if not impossible, to 
formulate in the S-R lang^lage. -ie showed how principles as well 
as associations and concepts can be represented in the SFL and 
indicated areas of SFL-based research either completed or underway 

during the past year. 

The final paper, ”Concept Learning In Mathematics (Research in 
the Emerging Discipline of Psycho-Mathematics)” attempts to glean 
some of the highlights of both theory and empirical research 
based on the SFL* We also suggest future directions for psycho- 
mathematics and comment on what may result in a major refinement 

of the SFL. 

Results 

During the year that this project was in operation, a great 
deal was accomplished. It would be impossible to summarize all 
that was learnedo In brief, we have devised and refined a new 



scientific language (SFL) which provides for the description of 
psychologically relevant subject matter characteristics. We have 
presented evidence in favor of using the principle! rather than 
the association! as the basic behavn-oi al unix in meaningful 
learning* A methodology! based on the SFL! was presented for assess 
ing ’'What is learned*” We also showed how the SFL and the 
related assessment methodology can be used to (re) formulate 
a variety of research questions (e*g* paired associate principle 
learning! reversal and nonreversal shifts! Piaget*type conserva* 
tion tasks! and symbolic and concrete learning)* In addition! a 
number of empirical research studies based on SFL analyses were 
described (e.g* Scandura, Woodward, and Lee (1967)1 Greeno and 
Scandura (I966)! Roughead and Scandura ( 196 ?) )• In the 
Roughead and Scandura ( 196 ?) study! for example! the SFL analysis 
of what is iuvolved in discovering mathematical rules indicated 
that discovery learners learn "something" by which they can 
derive solutions to an entire class of problems* We called this 
"something" h derivation rule* Using the SFL as a guide, we were 
able to identify that derivation rule taught in a guided discovery 
program used in an experiment by Gagne and Brown (I96I)* 



Conclusions and Implications 

We have pointed out certain inadequacies in existing 
behavioristic theory as it relates to mathematics learning* 

An alternative scientific language (SFL) was described and it 
was shown how the SFL could be used to formulate research questions 
involving mathematical learning and performance* The SFL helps 
filX the gap between the highly controlled studies of the learning 
laboratory and the more encompassing but less well specified 
research of the mathematics educator* 
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Ihere are many problems that need to be solved if a useful 
theory of mathematical learning is to be invented. The SFL in 
its present form is not sufficient to deal with a,,.! of 
For example, no attempt has been made to extend this approach 
to deal with instruction which simultaneously involves several 
objectives. Second, complex problem solving and proving theorems, 
in which there are several stages, have not been considered. 

Third, the question of whether the SFL can be used to deal with 
highly complex forms of interrelated knowledge is still open 
and, finally, the SFL says little or nothing about memory or 
“T?a7SSi efficiency (i.e. time to learn). However, none of these 
problems seems insurmountable and we are presently looking into 
the possibility of extending the SFL in two ways: (1) by using 

it as a basis for instructional planning in mathematics, and 
(2) by looking into the very intriguing possibility of reformulat 
ing the SFL in terms of the more fundamental, but abstruse, 
mathematical notion of a functor. Although it is too early to 
say what may develop, we are very optimistic. 
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The number of fascinating and weighty 
questions concerned with mathematics learn- 
ing is certainly large, if not uncountable. 
As critically important as it is for the psycho- 
mathematician* to ask significant questions, 
however, that alone is not sufficient. Unless 
such questions are formulated so as to 
provide definitive information, the knowl- 
edge so acquired may be worse than no 
information at all— it may be misleading. 

Since this problem is of no small concern 
in research on mathematics learning, I shall, 
in this paper, attempt the perhaps overly 
ambitious task of providing in-depth analyses 
of certain questions of concern to all mathe- 
matics (and science) educators. A major 
result of these analyses is a scientific set- 
funetion language (SFL) that maybe of some 
help to the researcher in formulating his 
questions in researchable form. 

To aecomplish this, I shall describe some 
of my own research over the pjist few years, 
replete with blunders and insights. Con- 
sideration is given to: (i) the assessment of 
“what is learned,” (2) the basic behavior 
unit in mathematics learning, (8) rule gen- 
erality, (4) interpretability and symbolism, 
(5) concrete and symbolic learning, (6) 

* The name “psycho-mathematics” ("psycho- 
mathematician”) derives by analogy from psycho- 
linguistics (psycho-linguist) and suggests a dual 
concern with human behavior and the structural 
characteristics of mathematics— the study of the 
behavioral invariants of mathematical structure. 



cueing in discovery learning, (7) learning 
mathematics by exposition and discovery. 
Before I begin, however, let me give some 
perspeetive to what is being attempted by 
reviewing the role of basie research and 
theory generally, and the need for and 
nature of seientific languages. 

Introduction 

Background 

The central aim of all basic, information- 
oriented research is theory development. 
Sound theory, in turn, provides the basis 
for many inventions of use to mankind. 
Einstein’s Theory of Relativity provided 
mueh of the motivation for harnessing the 
atom, and cell theoiy is leading in the diree- 
tion of cancer prevention — so a sound 
theory of mathematics learning (and teaeh- 
ing) may make it possible to construet in- 
struetional procedures, efficient beyond pres- 
ent-day expectation. 

While there has been a good deal of action 
research in the classroom and carefully 
controlled comparisons of different in- 
structional sequences or eurricula, there 
has been little attempt to identify just what 
it is that makes one method or eurrieulum 
better or worse than another. To aehieve 
a better understanding of how mathematics 
learning takes place, not only must the 
research be sound in design, but the right 
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questions must be asked and properly 
formulated.* 

I shall concentrate hem on the identifica- 
tion and formulation of significant questions 
involving mathematics learning. Let me 
be clear as to just what is intended. The 
concern is not simply with the identification 
of problem areas (e.g., the need for a better 
understanding of how to sequence mathe- 
matics instruction, the role of symbolism 
in mathematics learning, how slow learners 
learn mathematics) as important as that 
might be. The mathematics educator, rais- 
ing such questions, is much like the nuclear 
technician saying that we need to know more 
about the particles comprising atomic nuclei 
before we can successfully harness the power 
of the H-bomb. Neither is my task that 
of simply arguing for the clear and unam- 
biguous specification of experimental vari- 
ables. That is a necessary but not suf- 
ficient condition. 

What I would like to do is consider how 
the psycho-mathematician is led to raise 
significant new questions, and once raised, 
how he goes about formulating them in 
researchable form. Unfortunately, I know 
of no complete answer to either of these 
questions. Furthermore, no a 'priori guar- 
antee or procedure can ever be prescribed 
to insure that the scientist will not end up 
a blind alley. (This is probably a good 
thing since so many seemingly blind alleys 
have led to some of the biggest scientific 
breakthroughs.) Especially in the less de- 
veloped sciences, of which psycho-mathe- 
matics would be one, most research ques- 
tions are based solely on intuition. The 
adequacy of both their identification and 

* This paper is concerned only incidentally with 
questions of design. Competence in this area is 
assumed (not always a feasible assumption as regards 
educational research). Once his problem has been 
identified, the technically competent .scientist must 
be able to set up, conduct, and analyze his data .so 
as to answer the question initially posed. Without 
such comj)etence the scientist, whether physicist, 
chemist, psychologist, or p.sycho-mathematician, is 
like the mathematician who cannot prove routine 
theorems. It is his «me qua non. 



Specification are dependent on the perspicuity 
of the scientist’s intuition. 

Nonetheless, there are more formal means 
by which the scientist may be led to ask 
new questions. One of these is theory. 
Manipulations within the system, itself, 
may lead to new propositions ready for 
empirical test. The advantage of generat- 
ing research questions on the basis of theory, 
rather than by sheer intuition, is that the 
experimental results tend to either sub- 
stantiate or refute not just the question 
at hand but a whole set of interrelated 
propositions about the real world. Where 
such theory exists, a theoretical approach 
is certainly to be recommended, for economic 
as well as aesthetic reasons. There are 
simply too many questions that might other- 
wise be asked — and research is expensive 
of both time and money. As suggested 
above, however, the scientist must be 
constantly aware of the danger of restricting 
his concern to one theory with little atten- 
tion being given to the real world. He 
might well end up with a highly elegant 
theory that says nothing in particular 
about anything important. 

In any case, there has been very little 
theory based research on mathematics 
learning for the simple reason that there 
are few relevant theories. Hopefully for 
the future some highly promising work is 
undei^vay.* 

Rather than well-formulated deductive 
theories, however, the scientist may be led 
to ask new research questions by making use 
of scientific languages or other incomplete 
models. A scientific language is just what 
the name implies, a symbolic language with 
which to talk about scientific questions (e.g., 
to formulate research problems). 

As with theory, there are certain desirable 
characteristics of scientific languages. A 
language should, if it is to be an improvement 
over the native language, be concise with the 
constituent symbols having a precise mean- 
ing. Mathematics makes an ideal language 
in this sense. The primary requirement, 
however, is that the language accurately 
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represent the phenomena in question. With- 
out such fidelity the language can have no 
real value. The ideal language is one which 
strips an empirical situation of non-essentials 
while accurately and precisely retaining 
that which is central — a process much akin 
to abstraction. While the mathematician 
asks, “Why restrict one’s attention to 
euclidian spaces when any metric space 
will do?”, the psycho-mathematician asks 
such questions as “Why restrict one’s at- 
tention to the effects of familiarity, with the 
concepts of limit and absolute value, on 
learning the epsilon-delta definition of a 
limit when a large number of behaviorally 
similar situations exist in mathematics 
learning?” 

Although it may provide a rigorous basis 
for analyzing empirical situations, however, 
a scientific language may have only indirect 
relevance to any particular set of theoretical 
postulates. This is at once the major ad- 
vantage and the major limitation of a 
scientific language. It is an advantage 
in that no commitment is made to any 
one theory although usually some type of 
theory is implied. Thus, while the S-R 
language is used almost exclusively by 
contemporary experimental psychologists, 
the S-R theories employed vary from highly 
structured Hullian theory and Guthrie- 
Estes-type stochastic models to the use of 
relatively simple S-R diagrams with a 
minimal list of ad hoc assumptions. The 
major limitation of a scientific language is 
that empirical predictions cannot be formally 
derived without the addition of theoretical 
assumptions. 

Inadequacy of Existing Languages 

Granting the potential value of a language 
in promoting significant research on mathe- 
matics learning, the important question is, 
“What language?” As indicated above, 
an entire school of psychologists has found 
extremely useful the so-called S-R language, 
a language which, for the most part, has 
derived most of its basic concepts from 
animal experimentation. Is this molecular 



associationist language suitable for use in 
describing mathematics learning? Surely, 

I am not the first to think not.®* 

Other frameworks within which to treat 
mathematics learning, teaching, and/or 
transfer have been advanced at a more molar 
level, but they have been intended as a means 
for organizing literature reviews and not as 
a basis for making the fine discriminations 
necessary in formulating research on mathe- 
matics learning. Henderson® for example, 
viewed mathematics teaching as an ordered 
triple, teacher X teaching subject or topic 
Y to student(s) Z. Similarly, Becker and 
McLeod® have proposed a basis for char- 
acterizing transfer situations in mathematics. 

A most important outcome of recent col- 
laborative efforts between mathematics edu- 
cators and psychologists has been to focus 
attention on the close relationships between 
task and method variables. Mathematics 
educators and psychologists have increas- 
ingly come to realize that research on mathe- 
matics learning and teaching must, on the 
one hand, deal with mathematical structure, 
and, on the other hand, with observables 
(i.e., behavior). Little of scientific and 
practical value to education can be ac- 
complished by talking about either structure- 
free (i.e., rote) materials or unobservable 
mental processes. 

There has been a resulting recognition 
of the need for a precise language, couched 
in observables, which also provides for 
the description of psychologically relevant 
mathematical characteristics. To be useful, 
such a language should, at a minimum, make 
it possible to represent existing learning, 
what information is presented and how, 
criterion tasks, and structural relationships 
between them. 

How does one go about devising a scientific 
language? Here again no one answer can 
be given. What I shall describe in the 
following .sections is the chain of events 

* Further discussion of this and subsequent topics 
where indicated is available by corresponding with 
the author; he will welcome such commiuucations. 
Editor. 
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which led me to the set-function language 
(SFL) there introduced. My approach was, 
in part, haphazard and, in part, based on an 
analysis of just what is involved in certain 
aspects of mathematics learning (I clearly 
have not attempted to deal with all that is 
relevant). Where appropriate and possible 
the questions considered are represented 
symbolically, thereby exposing the basic 
structure of the empirical situation involved. 
Happily, it turns out that these critically 
important characteristics can be formulated 
precisely, largely in terms of the set and 
function concepts of mathematics. 

Before proceeding let me emphasize that 
I do not pretend to have the final word, but 
only what I feel is a step in the right direc- 
tion. 

The SFL and Research on Mathematics 
Learning 

What Is Learned and Response Consistency 

If, as has been proposed,^'^®-^^'^^ the cur- 
rent state of the learner is critical in deter- 
mining his response to a new stimulus, and, 
indeed, in determining his future learning, 
the assessment of “what is known” becomes 
a central task in psycho-mathematics. 

To see what is involved, consider the 
situation depicted in Figure 1. Suppose an 
experimental subject {S) is required to learn 
to say the indicated words when shown the 
learning stimuli (e-g-) to say “black,” 
when shown the large black triangle). 
After the four training S-R pairs aremastered. 



SO that the subject 


can reliably give the 


LEARNING 
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Fig 1. Sample learning, assessment (test one), and 
prediction (test two) stimuli and re.sponses. 



correct response to each stimulus, the ques- 
tion remains as to just what was learned. 
Did the subject learn four discrete pairs 
(i.e., associations), noticing no relation- 
ships between them? Or, did he learn 
the two principles, “If triangle, then color.” 
and “If circle, then size.”?* 

This question first began to bother me 
while working with Jim Greeno at Indiana 
University during the summer of 1962. 
In a study designed by Greeno,® we found, 
in a verbal concept learning situation, that 
essentially S either gives the correct response 
the first time he sees a transfer stimulus or 
the transfer item is learned in the same way 
as its control. 

The thought later occurred to me that if 
transfer obtains on the first trial, if at all, 
then responses to additional transfer items, 
at least under certain conditions, should be 
contingent on the response given to the 
first transfer stimulus. In effect, a first 
transfer stimulus could serve as a test to 
determine what had been learned during 
stage one, thereby making it possible to 
predict what response S would give to a 
second transfer stimulus. 

To test this assumption, I had a number 
of small groups of college graduates, a total 
of about fifteen Ss, overlearn a list similar 
to that shown in Figure 1. Prior to learning 
the lists, both the <Ss and the experimenter 
(E) agreed on the relevant values and di- 
mensions — size (large-small), color (black- 
white), and shape (circle-triangle). The 
iSs were told to learn the pairs as efficiently 
as they could since this might make it pos- 
sible for them to respond appropriately to the 
transfer stimuli. After learning, the Test 
One Stimuli were presented and the 5s 
were told to respond on the basis of what 
they had just learned. Positive reinforce- 
ment was given no matter what the response 
(i.e., the 5s were told they were correct). 
The Test Two Stimuli, then, were presented 
in the same manner. 

* Notice that both statements have the form, “If 
r, then R'.” The reason for using primes will be- 
come clear in a later section (Further Analyses). 



PKYCHO-MATHEMATICS 



257 



The results were clear-cut. All but three 
of these ^s gave the responses “black” 
and “large” respectively to the two Test 
One Stimuli (see Fig. 1), and also responded 
with “white” a iO, “small” to the Test Two 
Stimuli. On what basis could this happen? 
It was surely not a simple case of stimulus 
generalization;* the responses did not de- 
pend solely on common stimulus properties. 
The first Test One Stimulus, for example, 
is as much like the fourth learning stimulus 
as the first (see Fig. 1). 

Perhaps the simplest interpretation of the 
obtained results is that most of the ^s dis- 
covered the two underlying principles while 
learning the original list and later applied 
them to the test stimuli. In effect, the 
common relationships between the S-R 
pairs when combined with a response con- 
sistency hypothesis (i.e., when S “thinks” 
he’s right and the new situation is relevant, 
he will continue to respond in a similar 
manner) provided a basis for assessing “what 
was learned.” t 

Basic Unit of Behavior— Association 
or Principle 

In order to construct a precise descriptive 
language which adequately reflects mathe- 
matics learning, a basic behavior unit must 
be selected. The history of science has 
shown that the hypothesis-generating and 
predictive value of any theory or scientific 
language is determined in large part by the 
appropriateness of its basic building blocks. 

In S-R psychology this basic building 
block is the association, a learned connection 

* Stimulus generalization refers to the fact that 
there is a positive probability that a new stimulus, 
At, will elicit response B whenever At is similar to Ai 
and Ai already is known to elicit B. 

t In effect, this assessment procedure capitalizes 
on the Einstellmg (mental set) phenomena, first 
discovered by Luchins.® The fact that adult Ss tend 
to encode via dimensions (e.g., color) rather than 
stimulus values (e.g., black), however, may be at 
least partially original. 

As will be seen in the following study on llule 
Generality, response consistency also obtaiiu, with 
actual mathematical rules. 



between an observable stimulus (e.g., light, 
nonsense syllable, or mathematical problem) 
and an observable consequence or response 
(e.g., salivation, another nonsense syllable, 
or solution). A connection or association is 
said to be formed if the response appears 
with a positive probability whenever the 
corresponding stimulus is presented. 

Learning a concept, presumably a more 
complex form of learning, involves the ability 
to give a common response to any one of a 
set of stimuli. To say that S understands 
the concept of “circle,” for example, implies 
that S is able to say the name “circle,” 
the common response, when shown any 
example of a circle but will not say “circle” 
to any noncircle stimulus. 

In effect, v/hereas an association is a 1-1 
relationship, a concept is a many-1 relation- 
ship. Since 1-1 relationships are felt to be 
basic, the S-R psychologist has felt obliged 
to represent the many-l concept relation- 
ship as a composite of 1-1 relationships. 
This was made possible by the postulation of 
mediating links (associations). Thus, the 
many-1 relationship can be viewed as 




where Si, Sa, and S3 are stimuli connected 
to the mediating response Mrs whose stimulus 
properties, in turn, elicit R.* 

A still more complex form of learning is 
the principle. Knowing a principle makes 
it possible to give the appropriate response 
in a class of responses to any one of a class 
of stimuli. Principles are, thus, many- 
many relationships and operate somewhat 
as indicated by a statement of the form, 
“If A, then B.”^ Examples of principles 
in mathematics are easy to come by. The 

* III this display, Mrs refers to both the mediating 
response and its stimulus properties. Whereas S-R 
psychologists have made use of this distinction, no 
purpose is served in doing so here. 
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principle (statement), “If given a quadratic 
equation (of the form x- Bx C = 0), 
then the two roots, n and r-> may be ol)tained 
by solving the equation {—B±\^B- — 4C)/ 
2,” for example, makes it possible (assuming 
the necessary manipulative skills) to find the 
roots of any quadratic equation. 

Becau.se of their obvious relevance to 
mathematics (and other meaningful) learn- 
ing, it is perhaps surprising to the mathe- 
matics and science educator that psycholo- 
gists have done relatively little research 
on principle learning. In view of the pre- 
occupation with {issociations, however, it is 
perhaps not so surprising after all No 
one has made a serious attempt to char- 
acterize principles in terms of associations.* 
In attempting to see if it could be done, 
I came up with what appears to be an ac- 
curate representation. The result,® how- 
ever, is far from aesthetic. In fact, it is 
downright cumbersome— still more reason 
for psychologists to have almost completely 
avoided the study of principles. 

When the observable, and therefore scien- 
tifically relevant, aspects of the association, 
concept, and principle are formulated mathe- 
matically, the result, in each case, is simply 
a set of ordered stimulus-response pairs. 
A learned association is denoted by only 
oiie such pair, a concept by a set of pairs 
with a common response, and a principle 
by an arbitrary fudclion (i.e., a set of or- 
dered pairs, such that each stimulus cor- 
responds to one and only one response). 
When looked at in this way, the principle 
becomes the basic unit while the concept and 
association become special cases. 

The psycho-mathematician is, thus, faced 
with a dilemma: (/) maintain the associa- 
tion as the basic unit, along with a large 
body of existing psychological data, and 
have a cumbersome tool (language) with 
which to study mathematics learning, (2) 

* Gagtie^'* has advaticed a idrm« .lation of a priti- 
ciple in terms of concepts. While this represent.- tion 
is operational (i.e., usefun, it is questionable wh,u;er 
the association is the basic building block. 



adopt the principle and thereby base his 
research on a relatively unproven behavioral » 
unit, or (3) formuhite his research questions 
on a strictly intuitive, and relatively im- 
precise, basis. Although arguments can 
be made for each approach, the present 
paper is concerned only with the second. 

Principle Generality and Consistency 
in Mathematics Learning 

How general (abstract) should the presen- 
tation of mathematics be? Should addition 
and subtraction be taught as two distinct, 
although related, operations, as has been 
done traditionally, or as one operation as in 
more modern treatments? Should the three 
caso.s of percentage be taught separately 
or as variants of the principle, “base times 
rate equals percentage?” Should pupils 
be taught the method of “casting out nines” 
or be taught the more general principles of 
modular arithmetic? How generally should 
theorems be stated? Proofs? 

Mathematics educators have had to 
concern themselves with such problems, 
but they have had only intuition to guide • 
their judgment. There is a real need to 
better understand the psychological prin- 
ciples involved. Unfortunately, however, 
previous studies involving rule and principle 
learning have dealt only incidentally with 
the question of generality. 

Assuming that the answers hinge, at least 
in part, on learnability as well as general 
utility and armed with the denotative char- 
acterization of a principle as a function. 
Woodward, Lee and P® set out to explore 
this question. In particular, we were con- 
cerned with the effects of principle generality 
on learnability and transfer. We also ex- 
plored the response consistency hypothesis 
with more complex materials. Two ex- 
periments were conducted, the independent 
variable in both cases being the scope (i.e., 
generality) of a principal statement. Scope 
was defined in terms of the corresponding 
denotation, one statement being more general 
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than another if the denotation of the former 
included the latter.* 

Our original hypothe.ses were that: (1) 
the scope of a principle would be fully re- 
flected in performance, there would be 
little success on extra-scope problems and 
no differences in performance on within- 
scope problems, (2) the learnability of a 
statement, as determined by within-.scope 
performance, would vary inversely with 
scope, and (3) the combining rule taught 
would be used on all problems even when, 
after proposing a problem solution, S was 
not told whether he was right or wrong 
(i.e., under conditions of nonreinforcement). 

Experiment One. In the first experiment, 
each group of 17 college Ss (majors in ele- 
mentary education) was presented with one 
of three ordered principles dealing with a 
variant of the number game called 
In the game, two players alternately select 
numbers from a specified set of consecutive 
integers, beginning with one, and keep a 
running sum. The winner is the one who 
picks the last number in a series with a pre- 
determined sum. If, for example, this sum 
is 31 and the set consists of the integers 
1-6 the players alternatively select numbers 
from 1-6 until the cumulative sum is either 
31 or above (in which case no one wins). 
There are rules which allow the player who 
goes first to win. 

Any such game can be characterized by 
an ordered pair of integers. The application 
of each winning principle was illustrated with 
a common (6, 31) game. The least general 
principle (S), adequate for winning only 
(6, 31) games, was stated, “ . . . make 3 
your first selection. Then . . . make oolec- 
tions so that the sums corresponding to your 
selections differ by 7.” Principle (SG) 

* Notice that defining a principle as a function 
makes it possible to consider a variety of other rela- 
tionships between different principles. In particular, 
two functions (i.e., .sets) may be disjoint (i.e., have 
no instances in common), overlap, or be identical in 
addition to being ordered (i.e., one being more 
general than the other). Such relationships might 
provide a basis for formalizing quest ions concerning 
the effects of prior learning on latci- learning. 



was adequate for solving (6, j) games j = 1, 

2 , .. .,71 and was stated, “the first selection 
is determined by dividing the desired sum 
by 7 and making the remainder your first 
selection. . . .Then . . . make .selections .so 
that the sums corresponding to your selec- 
tions differ by 7.” The most general principle 
(G) was adequate for solving (f, j) games i 
= 1, 2, . . . , m; j = 1,2, ...,n and was 
stated, “the first selection i;. determined by 
dividing the desired sum by one more than 
the largest integer in the set from which 
the selections must come and making the 
remainder your first selection. . . . Then 
. . . make selections so that the sums cor- 
responding to your selections differ by one 
more than the largest integer in the set.” 

All Ss, including two control groups, were 
tested on three problems. The first was 
within the scope of each principle, the second 
within the scope of all but principle S, 
and the third only within the scope of prin- 
ciple G. 

The results were straightforward.* Of 
those 13 Ss in group S who solved problem 
one, none solved problem two, and only 
one solved problem three. The correspond- 
ing numbers for groups SG and G were, 
respectively, 5, 4, 0 and 5, 5, 4. Within 
the scope of each principle there were only 
chance differences in performance on the 
problems. On the other hand, only one S 
solved an extra-scope problem. 

The relative interpretability of the three 
rule statements was determined by com- 
paring group performance on problem one 
which was within the scope of each. Rule 
S proved to be easier to learn, under the 
self-paced conditions, than were the rules 
SG and G {p <0.007 in both cases). There 
was, however, no discernable difference in 
the interpretability of rules SG and G. 

The third facet of this research was con- 
cerned with the consistency with which 
presented principles are applied. W e wanted 
to determine whether the S and SG Ss 
would use the rule taught even when it was 

* Exact probability tests on 2 X 2 contingency 
tables were lused to test the various hypotheses. 
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inappropriate (on the second and third 
problems). To make this possible, no 
information was given as to when the prin- 
ciples were and were not appropriate. 

Of the 17 S iSs, 13, 9, and 8 u.sed the rule 
taught on problems one, two, and three, 
respectively. The corresponding numbers 
in groups SG and G were 7,7, and 5 and 6, 

6, and 6. Although there was a slight tend- 
ency to not use the rules taught on problems 
two and/or three, where they were in- 
appropriate, there were no significant dif- 
ferences in frequency of use. 

These results certainly provided strong 
support for our original hypotheses: {1) 
performance on within-scope problems did 
not differ appreciably, even though the 
common illustration was more similar to 
problem one than the others, and successful 
problem solving was limited almost ex- 
clusively to within-scope problems, (^) 
rule S proved easier to interpret than rules 
SG and G, and {3) the rules taught tended 
to be used consistently on all problems 
whether they were appropriate or not.* 

About the only major unanticipated re- 
sult in experiment one was that rule G 
proved as easy to interpret as rule SG. 
In view of the rather low proportion of 
successes in these groups, we were originally 
tempted to attribute the lack of such an 

* The first mentioned result has particular rele- 
vance for the psychologist since it crystallizes the 
fact that no generalization gradient is to be expected 
when the stimulus values are discrete rather than 
based on a continuous physical dimension. If there 
were such a gradient, performance on the first test 
problem, which was more similar to the example, 
should have been superior to that on the other prob- 
lems. Even S-R associationists are generally 
agreed that the lack of such an effect provides in- 
direct support for a rule or principle interpretation. 
To the extent that the variables involved in mean- 
ingful learning are discrete, a rule interpretation may 
prove more useful. 

Furthermore, when the underlying stimulus di- 
mension(s) are continuous, S-R theorists will need to 
consider the possibility that generalization gradients 
are simply artifacts of averaging individual differ- 
ences in perceptual discrimination (and, hence, what 
rule is Icarneo ) over continuous dimensions. 



effect to scale insensitivity near its lower 
extreme. 

Experiment Two. To determine the gen- 
erality of these findings, a second experi- 
ment, dealing with arithmetic series, was 
conducted with junior high school iSs. In 
this experiment, both scope (S, SG, G) and 
example (present, absent) were varied 
independently. Since most of the <Ss were 
already familiar with the arithmetic opera- 
tions introduced and, to some extent, with 
number series generally (i.e., as in adding 
i; .ts), it was felt that examples might provide 
a basis for generalization, via discovery, to 
extra-scope problems. Another difference 
between this experiment and the first was 
that rule S, 50 X 50 (= 2500), was effec- 
tively an answer given treatment and applied 
to only one series. This series was used 
both as the common example and as prob- 
lem one. In experiment one, rule S applied 
to a number of different game sequences. 

Although the pattern of results shown in 
Table I paralleled those of experiment one 
in most respects, there were several im- 
portant differences. First, the presence 
of the example (problem one) along with 
rule S resulted in significantly better per- 
formance on problem two than when rule S 
was shown alone, the only case in either ex- 
periment where nonnegligible success was 
noted on an extra-scope problem. This 
effect may have been due to the form of the 
combining operation, “50 X 50,” in the rule 
S statement. “50 X 50” is clearly an 
instance of the more general SG combining 
rule, “n X w = n^.” Presumably, the 
statement of rule S, together with the com- 
mon illustrative series, 1 -f 3 -f 5 -f . . . + 
97 -f 99, provided the successful S iSs with 
enough cues to generalize. In particular, 
they may have discovered that this series 
had 50 terms. Hindsight suggests that this 
difficulty might have been overcome by 
simply stating the sum, 2500, of the il- 
lustrative series rather than “50 X 50.” 
Second, Table I indicates that only three 
of the 19 G-with-example iSs solved problem 
three whereas 18 solved problem one and 
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TABLE I 

Sun mary: Number of Correct Sums (Uses of Rule Taught) on Problems One, Two, and Three 

Example 



N 



one 



two 



three 



N 



one 



two 



three 



Group S 


20 


8(8) 


1(1) 


2(1) 


Group SG 


20 


5 (5) 


4 (5) 


0(5) 


Group G 


19 


3(5) 


5(7) 


2(7) 



21 


20 (20) 


9(0) 


15 


11 (12) 


8(9) 


19 


18 (18) 


14 (16) 



3(0) 

1 ( 6 ) 

3(15) 



14 solved problem two. The decrement 
between problems two and three was sig- 
nificant ip <0.004). The reason for this 
difference was not immediately apparent 
especially since 15 of these >Ss applied rule 
G to the third problem. A more intensive 
post hoc analysis of the situation, however, 
suggested that the result may have been 
due to a difference in ease of determining JV, 
the number of terms, for use in the G com- 
bining rule, [(A + L)/2]N. N could be 
determined from problem series one and two 
by taking the average of the first and last 
terms. A careful examination of the test 
papers suggested that this led to the in- 
correct value (25, rather than 24) for JV 
in the third series, 2-}-4-l-6+... + 46 
-}- 48. In short, the difficulty was not in 
the rule but in finding the correct value of 
JV. Such difficulties may be circumvented 
in future experimentation by controlling 
for such unwanted differences.* 

Third, although the results of experiment 
two were in the hypothesized direction, only 
the overall effect of scope on interpretability 
was significant. This led us to wonder 
whether interpretability of the principle 
statements depended solely on generality. 
Could the principle statements have also 
differed as to the difficulty of interpreting 
the actual terms or symbols used? After 
consideration of this possibility, interpret- 
ability was rejected as an important factor 
in e..pcriment two since a recheck convinced 
us that we had succeeded reasonably well 

* It may be desirable to think of properties, such 
as iV, as being derived from lower order (l.e., more 
easily discernible) stlmulns proi)ertles. Thus, the 
rule, (A + L)/2, worked for problems one and two 
whereas 1/2 was recjulred for problem three. 



in stating each principle as clearly as pos- 
sible. Perhaps a more likely interpretation 
is that the >Ss’ familiarity with arithmetic 
interacted with the materials used so as to re- 
duce the effects of statement generality. 

Fourth, only one of the >Ss who was shown 
the rule, 50 X 50, applied it to problems 
two and three. This result can probably 
be attributed to an interfering effect due 
to prior familiarity with addition problems. 
The >Ss may simply have mistrusted rule 
S. How could a rule like 50 X 50, having 
only one answei> be the sum of all three 
problem series? Most junior high school 
>Ss would find it unreasonable that the 
series 1 -}- 3 + . . . + 99 (problem one) 
and 1 -b 3 + . . . + 79 (problem two) have 
the same sum (50 X 50). Some such 
reluctance may also have obtained on prob- 
lem one with group S-without-example. 
Nonetheless, we were surprised that only 
8 of those 20 >Ss, not presented with the 
illustrative series, gave the correct sum 
(2500 or 50 X 50) for problem one. 

Implications and Theoretical Comment. 
The results of these experiments demon- 
strate, in a rather conclusive fashion, the 
behavioral relevance of principle generality. 
For the most part, successful performance 
was noted only on tasks within the scope of 
verbally stated principles. When prin- 
ciples are presented in an expository fashion, 
it is normally too much to expect generaliza- 
tion to problems to which the principle does 
immediately apply. t 

Of perhaps even greater practical sig- 
nificance were the la(ik (there was one excep- 

t We have an exi)eriment underway which v;e 
hope will help to Identify the conditions under which 
extra-scoi)e generalization may be expected. 
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tion) of performance differences on within 
seope problems and the consistency results. 
The former result demonstrates that (ahno.st) 
any stimulus within the scope of a principle 
is equally as difficult to re.spond to correctly 
as any other. Furthermore, coupled with 
the eonsisteney data cited above, the ob- 
tained eonsisteney results suggest that 
only one (new) te.st .stimulus is needed to 
determine whether, in fact, a given prin- 
ciple has been learned. No more informa- 
tion is gained by using additional test in- 
stanees. These results could have far-reach- 
ing implications for the development of highly 
efficient measuring instruments. 

In addition, the pronounced tendency of 
the Ss to attack all of the test problems in 
the same way, irrespective of whether the 
procedure used was appropriate, suggests 
that the ability (i.e., knowing how) to solve 
problems and knowing when to make use of 
this ability to solve problems are quite 
distinct. Testing for the latter ability 
necessarily must involve the presentation 
of extra-scope problems. 

Perhaps even more important than the 
results of these exploratory experiments 
were the post hoc analyses they made both 
necessary and possible. In particular, the 
preceding discussion strongly suggests that 
the roles played by various aspects of a 
principle statement need to be more clearly 
specified. The form, “If F, then R'” 
does not detail all that appears relevant. 
For one thing, it was not possible in the rule 
generality study to distinguish between the 
roles played hy A, L, and N (where A, L, 
and N have a particular meaning) and the 
algebraic expression, [(X + Y)/2]Z (where 
the variables have general relevance) . The 
former variables relate to properties of the 
series stimuli, while the latter is a ternary 
operation by which another such property 
(e.g., sums) may be derived. I', of course, 
although it played no role in the rule gen- 
erality study is also critical. It tells when, 
in fact, a rule can and cannot be applied. 
Thus, the rule, N^, is appropriate whenever 
an arithmetic series con.sists of the consecu- 



tive odd integers beginning with 1 while 
[(yl -f L)/2]N works whenever there is a 
eommon difference between adjacent terms, 
an arithmetic series eonsists of the con- 
secutive odd integers beginning with 1 while 
[(A + L)/2]N works whenever there is a 
common difference betw'cen adjacent terms. 

These observations suggest that, a principle 
statement may be represented more appro- 
priately by the form, “// I', then O' (D') 
= R',” where 1' refers to the set of stimulus 
properties which indicate when the rule, 
denoted O' (D'), should be applied, D' refers 
to the set of those properties which determine 
the responses, and O', the operation from which 
the responses, denoted by R', may be derived 
from the properties referred toby D'. 

Similarly, a principle, that internalized 
repiccentation which determines a learner's 
responses to stimuli, may be characterized hy 
an ordered four-tuple (I, D, 0, R). Primes 
may be used (as has already been done im- 
plicitly) to distinguish between referents {e.g., 0) 
and the symbols used to represent them {e.g., 
O', 0"). 

Although the actual symbols used in a 
statement may be an important factor as 
has been suggested above and as will be 
demonstrated in the next section, the hy- 
pothesis advanced in the rule generality 
study to the effect that scope and learnability 
are inversely related finds a formal rationale 
in the nature of the characterizing elements. 
]\Iaking operational use, for example, of the 
arithmetic series property (i.e., dimension), 
“the difference between adjacent terms is 
some common value,” necessarily presumes 
that, “the difference between adjacent 
terms is two," “ . . . three," “etc.,” can all 
be correctly interpreted. The converse 
does not necessarily follow. A similar 
relationship exists with respect to the rules, 
50 X 50 and N X N. To correctly apply 
the latter, more general, rule to any par- 
ticular series requires the ability to deter- 
mine any value of the dimension N, in- 
cluding 50. Being able to apply 50 X 50, 
how’cver, does not. 

It w'ould appear that the more general 
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the principle the more is expected of the 
learner. Whether such differences will be 
reflected in behavior, however, may depend 
on not only rule generality but the popula- 
tion involved, particularly on whether the 
*Ss have the necessary requisite abilities. 

In effect, differences in generality appear, 
on analysis, to be equivalent to differences 
in abstraction level. Thus, the number two 
is more abstract than the property iivo 
oranges because the former applies to a 
collection of sets only one (subcollection) of 
which has the latter property. For the 
same reason, the property represented by the 
place holder X is more abstract* than the 
number two since it refers to a still higher 
order collection. Unfortunately, we have 
not yet conducted a study designed to pro- 
vide definitive information on the.se points. 
For the present, this analysis remains hypo- 
thetical. 

Interpretahility and Symbolism 

To help clarify the role symbolism plays in 
mathematics learning, I recently completed 
a study’® with the help of John Davis in 
which we varied both the symbols actually 
used to construct statements of a principle 
and the ability of an S to interpret these 
symbols. It seems almost axiomatic that the 
ability to interpret a statement of principle 
depends critically on the ability to interpret 
the symbols of which it is composed,’^ be 
they mathematical symbols or elements of 
the native language (e.g., English). None- 
theless, in mathematics learning the use of 
mathematical symbolism is frequently, if 
not always, preferred to ordinary English. 
The reason why, however, is never made 
explicit. 

The purpose of this study was to deter- 
mine whether: (1) principles are more 

easily memorized when stated symbolically 
or when stated verbally* and {^) the ability 
to correctly use constituent symbols and 
the required (grammatical) combining rules 

* See footnote page 2“)"). Editor. 



is a necessary and/or sufficient condition 
for applying a learned (i.e., memorized) 
principle statement. 

Method. Four artificial principles, each 
unfamiliar to the 24 *Ss (college majors in 
elementary educa Ion), were selected for 
study. Each principle was based on one 
of the following notions: greate.st integer, 
sigma notation for sequential addition, 
vector, and partial derivative. Two state- 
ments of each principle were prepared; one 
was composed of unfamiliar mathematical 
symbolism and the other of carefully, yet 
succinctly, worded English. For example, 
the greatest integer rule was stated (in 
English), 

1 . Take the greatest integer in X. 

2. Take the greatest integer in Y. 

3. Divide the result of step one by the 
result of step two. 

4 . Take the greatest integer in the quo- 
tient obtained in step three. 

The symbolic form of this rule was [([X] 

- [Y])]. 

Tasks also were designed to train the *Ss 
to interpret the constituent symbols. For 
example, one set of tasks involved the 
greatest integer function (i.e., | (W, [W])|all 
real W}). In addition, all of the *Ss were 
required to demonstrate proficiency in the 
use of parentheses as a means of signifying 
the order in which binary operations are 
to be taken. These conventional rules of 
grammar were involved in all four principles. 
Of course, neutral materials were used to 
teach and assess proficiency in the use of 
parentheses. In no case did the pretraining 
or assessment include either a complete 
rule or one of its instances. 

A 2 X 2 factorial design, with repeated 
measures, was used. Each S effectively 
served as his own control. One factor was 
the form in which a given principle was 
stated, symbolic or English. The other 
factor involved the presence or absence 
of training on the constituent symbols. Of 
course, the principles were counterbalanced 
over treatments so that each was used 
equally often under each of the four treat- 
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inents. All other unwanted factors were 
randomized, including presentation order. 

Separate measures of learning rate and 
interpretability were obtained. Learning 
rate was determined by presenting each 
principle for a fixed period of time for study 
and testing to see if the Sa could completely 
reproduce them in written form. All four 
principle statements were shown once before 
the next go-through (trial) began. Ease 
of learning was determined by the number 
of trials it took to learn each principle to a 
criterion of two perfect reproductions in a 
row. 

Interpretability was measured imme- 
diately after all of the statements had been 
well-learned. To demonstrat his “under- 
standing” of the statements, ^S' was required 
to apply each of the corresponding (under- 
lying) principles to two stimulus instances. 
For example, one of the problems used to 
determine whether S could apply the integer 
rule was stated simply, “If x =■- 8.64 and 
y = 3.24 then . . All four principles 
were tested once before the second set of 
test problems was given. 

Results. The results demonstrated quite 
clearly that: {1) symbolic rules are learned 
more rapidly, whether the constituent 
symbols are familiar or not (p <0.01)— there 
were only 2 exceptions (out of 24) to this 
generalization, (2) rules, stated in symbolic 
form, are applied successfully if and only 
if the *Ss have been taught how to apply the 
constituent symbols (and the necessary 
grammatical rules) — there were only 4 
exceptions to the sufficiency part of this 
generalization and none as regards necessity, 
(S) rules, stated in the native English 
language, are applied equally well whether 
or not training in the use of the correspond- 
ing mathematical symbols is given, and (4) 
English statements, once learned, are applied 
equally as well (in this study, somewhat 
better) as symbolic statements in which 
use of the constituent symbols has previously 
been mastered. 

These results are not entirely surprising 
but they do, nonetheless, make explicit 



at least one aspect of the role .symbolLsm 
plays in mathematics learning. The use 
of symbolism makes mathematics learning 
more effi(;ient when the constitr it symbols 
and grammatical combininj^ .ales have 
previously been mastered. Symbols, of 
course, also serve the practical function 
of requiring less .space in printing. None- 
theless, these results suggest that under 
certain conditions it may be well to remem- 
ber that ordinary English can be used to 
teach mathematical ideas.* 

Concrete and Symbolic Learning 

In the preceding section, the to-be-learned 
principles were stated either in mathematical 
symboli.sm or in the English language. Fur- 
ther, the principles taught were applied to 
symbolic stimuli, symbols which were stimu- 
lus instances of one of the arbitrary prin- 
ciples. It was determined that a learned 
principle statement could be applied only 
when the constituent symbols were interpret- 
able in terms of their less abstract symbolic 
referents. Thus, the ability to use the great- 
est integer rule depended on S’s being able 
to compute, say [8.64], having been given 
the meaning of [X] (i.e., take the greatest 
integer in X). In short, ive have so far been 
concerned e.vclusively with symbolic representa- 
tions; no concrete, or even iconic, forms have 
been considered. 

Many stimuli, however, are symbolic 
representations of an abstraction with 
concrete referents. A stimulus such as 

* These results provide .a national basis for making 
one type of branching decision that, while intuitively 
obvious, needs to be made e.xpUclt in computer- 
assisted instruction. Given the objective of learning 
a particular principle .and .an expository mode of 
instruction, one might proceed as follows: (/) test, 
to see if S can m.ake use of the constituent symbols; 
{2) if so, present, the principle in the more elflclent 
symbolic form; {S) if not, present the principle in 
English. 

Although learner feedback h.as long been recog- 
nized as an Important factor in promoting elliclent 
learning, it has been unclear .as what sort of feedback 
to measure. The present resull.'i suggest that specific 
sorts of feedback are needed in order to make specific 
kinds of decisions. 



PSYCHO-MATHEMATICS 



205 



3 5 4 - 7 /' for example, symbolizes 

an abstraction reflecting the structure of a 
variety of more concrete stimulus situations 
— e.g., four stacks of pennies, the first con- 
taining one penny, the second three, the 
third five, and the fourth seven; a figure 
representing the produce of four coun- 
tries, . . Thus, a learned principle state- 
ment is at least potentially applicable to 
concrete stimulus referents. 

Suppose a young child has been taught a 
principle which makes it possible to say 
“16” when shown “1 3 + 5 7” What 

happens when he is pre.sented with the four 
stacks of pennies and is a.sked how many 
there are? The answer to this question 
undoubtedly depends largely on the signifi- 
cance to S of the number symbols (i.e., 
numerals) in the symbolic stimulus. If the 
numerals refer to properties of collections 
of sets, each including a common number 
of elements or objects, and signifies 
combining, positive transfer would not be 
unexpected. If, on the other hand, the num- 
erals and the arithmetic operation of addi- 
tion has been learned entirely without con- 
crete referents, say with flash cards, one 
could feel fairly certain that S would see no 
such relationship. 

Fortunately, this question can be formu- 
lated precisely by characterizing the prin- 
ciples involved in the set-function language 
(SFL). Assume that the principle, cor- 
responding to the symbolic stimulus, has 
been determined, by assessment procedures, 
to be (I = {l, 3, 5, 7}, D = {l, 3, 5, 7}, 
R = { 16 }, 0 = sequential addition). The 
requisite for applying this (symbolic) prin- 
ciple, once learned, in a concrete situation 
is precisely that principle which makes it 
possible to go from the concrete situation 
to the corresponding numbers. A com- 
j)Gsite principle, including this principle, 
along with that corresponding to the sym- 
bolic .stimulus above, might be characterized 
(I = 1 1 penny, . . . , 7 pennies], D = 

R = jl6 pennies], 0 = translate, for all 
X, the property X pennies into the higher 
order (more abstract) property x (the num- 



ber), perform repeated addition, and trans- 
late X back into X pennie.s). Learning the 
.symbolic principle statement, without being 
able to recognize its concrete referents, would 
be like having an egg shell but no egg. 

The relatively simple hierarchical SFL 
analysis proposed provides, I feel, but a 
prelude to the insights which may eventuate 
from similar analyses in other situations. 
Even partial clarification of the relative 
roles of symbolism and concrete referents 
in mathematics learning and performance 
could have important practical as well as 
theoretical implications and is long overdue.* 

Attribute mid Operation Cueing in the 
Discovery of Mathematical Rules 

So far we have limited our illustrations 
and discussion to reception learning; that 
is, learning which takes place via the inter- 
pretation of .symbolic statements.f A good 
deal of mathematics, however, particularly 
in more modern treatments, is taught by 
di.scovery. The typical procedure involves 
presenting, one at a time, either stimuli 
or stimulus-response pairs corresponding 
to the to-be-discovered principle. To de- 
termine whether learning has taken place, 
the learner is usually asked to give the appro- 
priate response to new stimulus instances. 
The former type of situation, in which S 
is tested repeatedly, is exemplified by. 

The sum of the first 2 odd integers, 1 + 
3 = ? 

The sum of the first 8 odd integers, 1 
3 -{■ o = ? 



The learner, of course, would be expected 
to discover that the correct sum may be 
obtained by simply squaring the number of 
terms in the (odd integer) series. 

* This sort of .analysis may also prove useful in 
science. 

t As indicated in the previous section, other vari- 
ants of reception learning are possible, such as 
learning from diagrams or pictures (icons), as in 
geometry, and from concrete objects, such as with 
Dienes’ multiple embodiments.*'^ 
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Ostensibly to speed the discovery process, 
the teacher or auto-instructor might cue 
the critical aspects of the stimuli. Thus, 
underlining the number of odd integers 
(as above) or printing them in red would 
presumably attract attention by making 
the cues more salient. Of course, it would 
be equally possible to identify the appro- 
priate operation (i.e., squaring) or, to intro- 
duce various combinations of both determin- 
ing (D) and operation (0) cues. 

Although a good deal of verbal and non- 
verbal cueing goes on during the contem- 
porary discovery lesson in mathematics, 
there have been very few attempts to un- 
cover the basic mechanisms involved. 
Whereas, recently, there have been a few 
such studies concerned with concept learn- 
ing'®-*’ and, earlier, with problem solving'® 
no consideration has been given to principles. 
This is indeed, unfortunate since principles 
seem to underlie so much of mathematics 
learning. 

With this in mind, the members of my 
research seminar on mathematics learning* 
and I conducted a pilot study to determine 
the effects of verbal attribute and operation 
cueing on the rate of discovering mathe- 
matical principles. In particular, the study 
extended that of Haygood and Bourne'® in 
two ways: (1) principles were used instead 
of concepts and (2) the operations involved 
were arithmetic, rather than logical. The 
study was designed simply to determine 
whether identifying the determining at- 
tributes (i.e., D, the stimulus attributes 
which determine the responses) or the ap- 
propriate combining operation (i.e., 0) 
does, in fact, increase the rate at which 
arithmetical principles are discovered.! 

Method. To minimize the effects of 
individual differences, we again used artificial 
materials. The stimuli were four-tuples 

* Mike Bundrick, John Davis, Rosalie Jensen, 
Bob King, J^rank Pavlick, and Larry Smith. 

t Although the order in which stimuli are 
introduced is also used extensively to promote dis- 
covery, this factor was not considered in this 
preliminary effort. 



of numbers [e.g., (4, 8, 9, 3)] and the re- 
sponses were simply new integers that could 
be derived uniquely from exactly three of the 
four original integers by some combination 
of two (of the four) elementary arithmetic 
operations. 

Three principles were used. The de- 
termining characteristics and operations, 
respectively, were (1) Ai, A3, A4; {2) Ai, 
Ao, A4; (3) Ao, A3, A4; and {1) X Y 
- Z, {2) X • Y -i- Z, {3) X ' Y + Z where 
the subscripts, i = 1, 2, 3, 4, in Ai refer to 
position in the four-tuple and X, Y, and Z 
to place holders. 

A 3 X 3 design, with repeated measures 
on the second factor, was used. Factor 
one involved the type of cue given [none, 
determining attribute (D), operation (0)]. 
The second factor was a composite of the 
principle in question and the order of presen- 
tation* and gave some indication of the 
effects of one discovery on the next. The 
36 elementary education majors were ran- 
domly assigned to one of the three cue groups 
so that a given S was exposed to only one 
type of cue and each S completed three 
discovery episodes. 

The *Ss were told that their job was to 
write that number which they thought cor- 
responded to the four-tuple shown. They 
were also instructed, “There is a procedure 
by which you can always determine the cor- 
responding number when I show you the 
set.” Then, the control group was told, 
“To help you discover this procedure as 
rapidly as possible, you should try to de- 
termine the three specific positions in the 
four-tuple and a rule which combines the 
numbers in these positions to yield the cor- 
responding number.” The attribute group 

* No attempt was made to remove this confound- 
ing, by counterbalancing principles over order of 
presentation, since the study was de.signed as much 
as a learning experience (for my graduate students) 
as one of advancing knowledge. To have gotten too 
deeply involved in the details of design so early in 
their graduate research training could well have been 
self-defeating. I wanted them to think of research, 
first and foremost, as a conceptual experience rather 
than as a composite of technologies. 
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was told, “ . . . determine a rule which 
combines the numbers in the (proper posi- 
tions inserted) to yield the corresponding 
number.” The rule group was told, 

“ . . . determine the three specific positions 
in the four-tuple from which the numbers 
X, Y, and Z are always taken where (proper 
rule in-serted) yields the correct number.” 
After S responded to a given four-tuple, 
either by writing a number or by indicating 
he “didn’t know,” the card on which the 
four-tuple appeared was turned over ex- 
posing the same four-tuple together with 
the correct number response. The *Ss 
were given api)i’oximately 12 seconds to 
compare the four-tuple with its solution 
before the next four-tuple was shown. Ten 
such four-tuples comprised one problem 
and *S’s score was the total number of cor- 
rect responses made. 

The probability of giving a correct re- 
sponse, by chance, without discovering the 
corresponding principle was relatively small. 
This was evidenced by the fact that once a 
correct response was given, S almost in- 
variably gave the correct response thereafter. 

Results. The results are summarized 
in Table II. 

Both attribute and operation cueing in- 
duced significantly (p <0.01) earlier discov- 
ery of all three principles. But, whereas a 
significant improvement, presumably due to 
practice, was noted between problems two 
and three (p < 0.01), there was essentially 
no difference in performance on the first 
two problems. This is surprising since so- 
called “warm-up” effects typically have their 
greatest effect in the beginning. 

More intensive comparisons, however, 
indicated that problem one practice signifi- 

TABLE II 







Problem 




One 


Two 


Three 


Attribute 


6.6 


4.2 


8.1 


Operation 


3.5 


7.2 


8.7 


Control 


1.1 


0.6 


4.0 



cantly (p < 0.01) improved operation group 
performance on problem two but actually 
hindered (significantly, p < 0.01) attribute 
group performance — classic cases of positive 
and negative transfer. At least two pos- 
sible interpretations may be given for the 
latter finding. First, having discovered 
that .some combination of addition and sub- 
traction (Ai + As — A4) worked on prob- 
lem one, many of the attribute-cue *Ss 
may have spent too much time trying various 
combinations of addition and subtraction 
on problem two. Second, the rule, Ai 
• Ao A4, needed to solve problem two, 
since it involved multiplication and division 
as opposed to addition and subtraction, 
may have been intrinsically harder than 
that needed on problem one. Of course, 
both interpretations may have some degree 
of truth. Having been given the appro- 
priate rules, in both cases, the rule-cue <Ss 
were not subject to such effects. Further- 
more, whatever response set developed on 
the basis of discovering that positions 1, 3, 
and 4 were relevant on problem one was 
more than compensated for by the practice 
afforded. 

Clearly, reseanih, aimed specifically at 
such questions, is needed to provide defini- 
tive information. It is impossible to say, 
at this time, that we fully understand exactly 
what is involved or, even more important, 
what the boundary conditions for these 
findings are. 

Since specifying boundary conditions has 
all too frequently been passed over or, at 
most, been paid ambiguous lip service in ed- 
ucational research, I should like to emphasize 
one point about these results. It is doubtful 
that attribute cueing will ever be shown to be 
unconditionally better than operation cueing 
or vice versa. What future research may be 
expected to do, however, is to specify the 
conditions under which each will be superior. 

An Analysis of Learning Mathematics 
by Exposition and Discovery 

With the machinery and body of data 
built up, we are now in a position to provide 
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a fairly detailed analysis of learning by 
exposition and discovery. Because of space 
limitations, I shall limit my analysis to the 
question, “/s it better to learn by exposition 
or by discovery and, if there is no unique answer 
to this question, what are some of the conditions 
under which a particular method will be 
better?” In this regard, only the contention 
that learning by discovery enhances the 
learner^ s ability to solve new problems will be 
considered. Before attempting to answer 
this question, let me reemphasize the dif- 
ferences between reception learning and 
learning by discovery. 

Reception learning involves the encoding 
of information presented directly to the 
learner. Usually, but not necessarily, such 
information is presented in statement form. 
Understanding is said to occur, if having 
learned a statement so that it can be “par- 
roted back’’ on cue, S is also able to use it to 
successfully determine the responses to the 
specified set of stimuli. As formulated in 
the SFL, S is presented with a statement 
that may be put in the form, “If I', then 
O'(D') = R'.” Then, having memorized 
the statement, S is tested >n stimuli ar- 
bitrarily selected from the domain (i.e., 
the set of stimuli) of the corresponding 
denotative principle, {(S<, R/)|fel}. 

Learning by discovery, on the other hand, 
requires that the learner abstract the com- 
mon principle from a subset of the S-R pairs 
in the set {(S<, Ri)|fel(. He must, there- 
fore, identify the determining characteristics, 
D, the operation, 0, and, if required to 
discriminate between exemplar and non- 
exemplar stimuli, he must also determine 
the identifying characteristics, I. In effect, 
while reception learning presupposes the 
ability to interpret and apply that informa- 
tion represented by the symbols I', D’, 
O', R', learning by discovery requires the 
ability to identify, in a nonverbal fashion, and 
to use I, D, 0, and R. 

Unfortunately, most existing studies com- 
paring exposition and discovery have little 
to say about whether discovery enhances 



the ability to solve new problems as is so 
often proposed by pedagogical enthu- 
siasts.’®-^ Suppose, as is often done, the 
exposition group is presented directly with 
a rule or principle whereas the discovery 
group is simply presented with stimuli, 
to which the principle relates, and required 
to determine the responses.* Presumably, 
this is accomplished by discovering the 
associated principle. The two groups are 
then frequently tested with the original 
stimuli (e.g., problems), new stimuli within 
the domain of the principles involved, and 
stimuli requiring new principles for their 
solution. What is gained by comparing 
the expository and discovery groups on 
several measures, however, is lost in another 
way. Whenever differences in performance 
on within-scope problems are found, com- 
parisons on any more general transfer tests 
will necessarily be inconclusive. Thus, ob- 
tained differences in the ability to obtain 
solutions to new problems, based on new 
principles, might equally well be due to 
differences in original learning as due to 
any advantages inherent in a method itself. 
In short, such studies can never hope to 
determine whether discovery methods ac- 
tually improve the learner’s capacity for 
dealing with new problems. 

To make a valid comparison of the sort 
described, exposition and discovery groups 
must be equated on original learning. 
Suppose experimental *Ss are set with the 
task of learning several principles. In 
this case, the appropriate tasks on which 
to equate exposition and discovery groups 
are those which require the ability to apply 
the principles taught (or discovered). As- 
suming the necessary equivalence, tasks, 

* It is not uncommon in such studies, in fact, 
rather typical, for the rule-given group to perfoun 
better on within-scope problems. The symbolism 
study, described above, however, demonstrates 
quite clearly that this need not necessarily be so. 
If the constituent symbols are not familiar to the 
learner, his perforr^ance on within-, as well as e.xtra-, 
scope test problems may be e.xpected to be uniformly 
poor. 
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requiring new principles for their solution, 
may be posed to compare the groups as to 
the ability to deal with new problems (based 
on new principles). 

In order to predict how exposition and 
discovery groups would fare on such tasks, 
explicit consideration mu.st be given to what 
is required of the learner in the two situa- 
tions and the nature of the transfer prob- 
lems. Discovery /Ss must learn how to derive 
principles (those involved in the learning situ- 
ation) in order to achieve criterion; exposi- 
tion /Ss may not. Making predictions, then, 
really boils down to ivhat discovery Ss learn 
that exposition /Ss might not. It likely that 
discovery Ss, in attaining criterion, may dis- 
cover a derivation principle by which new 
principles, similar to those originally learned, 
can also be derived. In this case, discovery 
/Ss might be expected to perform better than 
expository /Ss on tasks which can be solved 
via principles within the scope of the deriva- 
tion principle discovered. On the other 
hand, discovery /Ss would probably have no 
special advantage on problems, requiring 
for their solution, principles beyond the 
scope of this derivation principle. 

Suppose, for example, that exposition 
and discovery groups are set with the task 
of determining the sums of number series 
— long series so that it is infeasible to per- 
form sequential addition. Further suppose 
that two formulas (i.e., rules), say and 
n^ + n, will suffice for aU of the learning 
series. The discovery Ss, of course, would 
have to discover these formulas whereas they 
would simply be presented to the exposition 
/Ss. Assuming equal mastery by the two 
groups, one would make the following pre- 
dictions. If the formula for obtaining the 
sum of a new transfer series may be derived 
in the same manner (i.e., via the same deriva- 
tion principle) as the two original formulas, 
then the discovery /Ss may be expected to 
demonstrate superior ability. If not, no such 
difference may be expected. 

Consider a second example in which the 
Ss are to demonstrate their ability to, say, 
construct one of those models corresponding 



to each of two finite geometries.* Suppose 
the corresponding axiom systems are: 

A. 1 there exists at least one point. 

2 each point is on exactly two lines. 

3 each line contains exactly two points. 

4 two points determine at most one 
line. 

5 there are no two lines not having 
a point in common. 

B. 1,2, 3, and 4 identical to system A. 

5 to each line, there corresponds 
exactly two lines which do not have 
a point in common with it. 
then, the models 




and 




correspond to systems A and B, respectively. 

The task posed might be viewed as in- 
volving two discrete principles, each having 
a denotation consisting of one stimulus (i.e., 
list of axioms) and one response (i.e., model). 
If the exposition /Ss are presented with the 




models directly while the discovery /Ss 
are required to derive them, it would not be 
surprising if the discovery /Ss could derive 
the model 



II ■ 

* It is assumed, of course, that referents, such as 
and have been assigned to the unde- 
fined terms. 
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for the system, 

C. 1, 2, 3, and 4 identical to systems A 

and B. 

5 to each line, there corresponds 
exactly one line which does not 
have a point in common with it, 

while the exposition /Ss could not. On the 
other hand, an axiom system of the form, 

D. 1 there exists exactly nine points, 

2 each line contains at least three 
points, 

3 ..., 

might well pose equivalent difficulties for 
both groups. 

It appears, on analysis, that these experi- 
ments were rigged in favor of discovery. 
The discovery /Ss were required to learn a 
principle(s) for deriving solutions; the ex- 
position were not. Of course, nature is 
not always as simple as we would like so 
that it is, indeed, highly encouraging at a 
well-controlled study by Gagne and Brown 
provides strong support for the essential 
correctness of the analysis proposed. Learn- 
ing by discovery does, in fact, seem to im- 
prove ability to solve new (presumably 
within-scope) problems when original learn- 
ing is controlled. 

Nonetheless, consider what happens when 
we bring guidance into the discovery situa- 
tion. Hints might be given, for example, 
to cue the determining attributes, D. They 
might also be used to direct the learner 
towards the appropriate combining opera- 
tion, 0; or, towards I. In fact, the Ss might 
be given all of this information directly. 
But, then, is not this something very much 
akin to teaching by exposition? It would 
appear to be at least theoretically j)ossible to 
present derivation principles in an expository 
manner rather than to depend on discovery. 

So far, I have avoided the $64 question- 
just what are derivation principles and can 
they be stated in expository form? It turns 
out that derivation rules can be specified for 
the illustrations cited. The rule for deriving 
formulas for obtaining the sum of any arith- 



metic series can be stated, “Write above and 
below each term of the series, respeetivcly, 
its position number (n) and the cumulative 
sum of the series through that term of the 

71 

series (S), consider the sequence of ratios, 

n 

S/n, and express (via induction) the general 
term of this sequence, as a function, f{n), of 

n n 

n, then S/?i = f{n) so S = n-f{n) from 
which the sum of the (presumably too long 
to add) series may be determined by substi- 
tution.”* 

A still simpler derivation principle would 
work in the case of the finite geometries 
example. Let the determining attribute 
be the number of lines not having a point in 
common with a given line (axiom 5). The 
combining operation, 0, would be that 
mapping which takes this number into a 
regular polygon with three more sides than 
the number in question. Whether this 
specification concurs with what the reader 
feels that a mathematics student should learn 
in such a situation is not the point. I, too, 
would be unhappy if this is all he learned. 
The point is that he could solve the problems 
posC J, armed only with the simple derivation 
rule indicated. t 

As suggested by the results of the sym- 
bolism study, being able to specify and state 
a higher order derivation rule in verbal 
(i.e., symbolic) form may not always be 
sufficient to insure learning. Among other 
things, the language used must be inter- 
pretable by the learner. S might not, for 

* This derivation procedure closely parallels that 
used in Gagne and Brown’s discovery treatments.^*^ 
The author is indebted to Robert M. Gagne for 
making his original materials available for analysis. 

Such formulas, or better still the sums, might also 
be derived directly from the formula [(A + L)/2]N. 
where A refers to the first term of the series and L, 
the last. To correspond to this “higher order” 
exposition treatment, of course, a discovery treat- 
ment would have to be constructed so as to guide the 
learner into discovering some equivalent of [(A + 
L)/2]iV. 

t One way to get clo.ser to a “more desired” sort of 
learning may be by imposing additional performance 
restrictions. See the next section. 
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example, know to what the symbol “S’’ 
refers. Unle.ss the derivation rule can be 
specified in a symbolic form that the learner 
can correctly interpret, there is no alterna- 
tive to learning by discovery. If, on the 
other hand, a comprehensible .symbolic 
(or iconic) representation can be given, the 
question of whether exposition or discovery 
is to be preferred will depend on factors, 
other than immediate performance capa- 
bilities, such as learning efficiency and 
retention— two questions to which we have 
not addressed ourselves. 

In many other cases, of coui'se, it may be 
impossible to identify an appropriate deriva- 
tion principle due to the complexity of the 
situation. Under these circumstances learn- 
ing by discovery would be indicated. 

Insofar as performance capabilities are 
concerned, this analysis has not, as yet, 
been tested. The empirical results obtained 
in the “real world” may, as they often do, 
indicate inadequacies in the analysis. Wil- 
liam Roughead and I have an experiment 
underway which hopefully will help provide 
further clarification. 

Summary and Concluding Remarks 

Let me first summarize briefly the basic 
characteristics of the set-function language 
(SFL). The principle, rather than the 
association, is assumed to be the basic be- 
havior unit. Principles were defined as 
units of acquired knowledge which make it 
possible to give the appropriate response, in 
a class of responses, to each stimulus in a 
class of stimuli. In effect, the denotation of 
a 'principle (a function) is a set of ordered 
S-R pairs such that to each S exactly one R 
is assigned. If the responses in the denota- 
tive set are all identical, then the resulting 
principle is commonly called a concept. If 
only one S-R pair is involved, an association 
results. 

The essential nature of any principle 
(i.e., internalized element of knowledge) 
can be characterized by an ordered four- 
tuple (I, D, 0, R) where I refers to that set 
of stimulus properties (or dimensions) in- 



dicating when the principle is to be applied, 
D refers to the set of stimulus properties 
(or dimensions) from which the response 
dimension, R, is derived, and 0 is the com- 
bining operation or rule by which R is 
derived from the properties in D. In the 
case of the association, the properties in 
I and D are not only identical but may be 
viewed as simply referring to the stimulus 
itself. In concept learning, the correspond- 
ing properties refer to classes of stimuli. 
In principle learning, the properties refer 
to dimensions having values which are prop- 
erties of a class of stimuli (e.g., X is a di- 
mension referring to the numbers 2, 5, . . .). 

To emphasize the distinction between 
the four elements characterizing a principle 
and symbolic representations of these ele- 
ments, the latter are represented by primes. 
Any principle can be stated in the form, 
“IfF, thenO'(D')=RV’ 

Let me reemphasize why I feel that some 
such language is needed. Much of the 
research in mathematics education, to date, 
has been of a fragmentary nature. A 
fundamental reason for this situation is that 
the wrong variables have too often been 
studied. By selecting easy to specify pheno- 
typic variables, theory development has 
been made unnecessarily difficult. Although 
chronological age, for example, has been 
demonstrated to relate to a wide variety 
of behavioral phenomena, it is not an under- 
lying cause. Time does not cause anything 
but only provides an occasion for things to 
happen. Age may, however, covary with 
underlying causes — e.g., the prerequisites 
for a given bit of learning. Integrative 
theories, leading to a more profound under- 
standing of mathematics learning and teach- 
ing, can only be hoped for if more emphasis 
is given to the study of such underlying 
(genotypic) variables. Mathematics learn- 
ing depends so critically on what the learner 
already knows and can do, what information 
is presented, and the test problems proposed 
that it seems unlikely that any profound 
understanding will eventuate from research 
which does not take into account the re- 
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lationships involved. Thiss will, of (*our.se, 
include a careful analysis of the mathe- 
matics itself, but cannot be limited to it. 

A precise language, like the SFL, may 
not only help to guide the researcher in 
his quest for new and hopefully important 
questions but may be of even more help 
in formulating his questions in researehable 
form. Such a language may also help guard 
against the inappropriate generalization 
of research findings. One of the most im- 
portant limitations of educational re.seareh 
has been that so little attention has been 
paid to determining the boundary conditions 
under which certain results will and will 
not obtain. Equally important, the SFL 
helps fill the gap between the highly con- 
trolled studies of the learning laboratory 
and the niOiu encompassing but less well 
specified research of the mathematics edu- 
cator. If nothing else, it may set the 
psychologist to wondering whether he has 
been studying the right things and the 
mathematics educator to searching for 
better and more precise means for coming 
to grips with his research problems. In 
particular, it is hoped that some guidelines 
will have been provided for the relatively 
inexperienced investigator and some food 
for thought for the active research worker in 
'psycho-mathemalics. 

In spite of its relatively rigorous founda- 
tion the SFL has been used sparingly in the 
analyses and research described above. In 
relatively few cases was the language used 
formally. Such sparing use, however, is 
not unusual for scientific languages. Chem- 
ists, for example, refer to structural models 
only when needed to clarify difficult points; 
experimental psychologists use the S-R 
symbolism in a similar fashion. The im- 
portant point is that by having a precise 
language to resort to when needed, fewer 
misunderstandings and more continuity 
of effort are to be expected. 

Needed Research 

Do I think that the SFL, in its present 
form, is sufficient to deal with most mathe- 



matics learning? Hardly! In the first 
place, many of the analy.ses proposed have 
not been put to experimental test. As 
we have seen in the study on rule generality 
a seemingly logical analysis of the situation 
is not necessarily fully reflected in the 
learner's performance. Psychology, in 
effect, is not logic and logic is not psychology. 
What frequently happens is that an analy.sis 
points up new questions, these questions are 
put to behavioral test, and the results in- 
dicate discrepancies in need of further 
analysis. This is the method of behavioral, 
nay any, science. 

More important, in my quest for depth, 
I have had to sacrifice breadth. Let me 
list what I see as some of the most crucially 
needed research. First, no attempt has 
yet been made to extend this approach to 
deal with instruction which simultaneously 
involves several objectives.^® Increasing 
the number of behavioral objectives im- 
poses additional constraints on the nature 
of the learning to be achieved. The acquired 
knowledge must make it possible to perform 
successfully on a variety of tasks. 

Second, complex problem solving and 
proving theorems, in which there are several 
stages, have not been considered. Examples 
have only been given for those cases in which 
the combining rule, 0, was effectively an 
algorithm. Nothing was said about heu- 
ristics, those procedures which mi{iht, but 
will not necessarily, lead to a solution. As 
the reader will no doubt have noticed, even 
the earlier references to constructing models 
of finite geometries was reduced to a rather 
benign one-step principle. 

Thiid, the question of whether the SFL 
can be used to deal with highly complex 
forms of interrelated knowledge is still 
open. A good deal of mathematical per- 
formance is apparently based on the acquisi- 
tion of internalized models or images. On 
the basis of such knowledge, the learner 
can frequently respond appropriately to 
questions concerned with a whole class of 
principles rather than simply one. The 
problem is further compounded since, as 
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Easely2® has argued so pervasively, there 
are important relationships between the 
various levels of mathematics learning: (f) 
concrete models, (2) the mathematical 
theory, itself, and (3) a suitable set of rules 
governing the syntax of the mathematical 
theory (i.e., a suitable logic). Can such 
learning be represented in the SFL and, if 
.so, how? 

Fourth, the SFL says little or nothing 
about efficiency or time to learn. Here, 
as was suggested in the last section, is where 
learning theories must play an important 
role.” 

These are just four of the critical problems 
that need to be solved if a precise theory of 
mathematical learning is to be invented.* 
How they are to be solved, of course, is not 
entirely clear. Perhaps the procedures of 
task analysis^* may be extended to deal 
with multiple objectives. Composite prin- 
ciples (i.e., functions) could well prove useful 
in dealing with problem solving and proving 
theorems, the component principles being 
used to represent individual steps in multi- 
stage processes where the effective stimulus 
is continually changing. Even heuristics 
may, in fact, turn out to be nothing more 
than sequences of algorithms. 

Whatever the truth may be, we will 
never know unless many more mathematics 
educators are attracted into the prenatal 
field of psycho-mathematics. 

This research was supported in part by the U.S. 
Office of Education. 
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